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(54) Nucleic acid integration in eukaryotes 

(57) The invention relate to the field of molecular bi- 
ology and cell biology. It particularly relates to methods 
to direct integration of a nucleic acid of interest towards 
homologous recombination and uses thereof. The 
present invention discloses factors involved in integra- 
tion of a nucleic acid by illegitimate recombination which 
provides a method to direct integration of a nucleic acid 
of interest to a pre-determined site, whereby said nucle- 
ic acid has homology at or around the said pre-deter- 



mined site, in an eukaryote with a preference for non- 
homologous recombination comprising steering an inte- 
gration pathway towards homologous recombination. 
Furthermore, the invention provides a method to direct 
integration of a nucleic acid of interest to a sub-telomeric 
and/or telomeric region in an eukaryote with a prefer- 
ence for non-homologous recombination. 
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Description 

[0001] The invention relates to the field of molecular biology and cell biology. It particularly relates to methods to 
direct integration towards homologous recombination and uses thereof. Several methods are known to transfer nucleic 

5 acids to, in particular, eukaryotic cells. In some methods the nucleic acid of interest is transferred to the cytoplasm of 
the cell, in some the nucleic acid of interest is integrated into the genome of the host. Many different vehicles for transfer 
of the nucleic acid are known. For different kinds of cells, different systems can be used, although many systems are 
more widely applicable than just a certain kind of cells. In plants, e.g., a system based on Agrobacterium tumefaciens 
is often applied. This system is one of the systems that can be used according to the invention. 

io [0002] The soil bacterium Agrobacterium tumefaciens is able to transfer part of its tumor-inducing (Ti) plasmid, the 
transferred (T-) DNA, to plant cells. This results in crown gall tumor formation on plants due to expression of onc-genes, 
which are present on the T-DNA. Virulence (vir) genes, located elsewhere on the Ti-plasmid, mediate T-DNA transfer 
to the plant cell. Some Vir proteins accompany the T-DNA during its transfer to the plant ceil to protect the T-DNA and 
to mediate its transfer to the plant nucleus. Once in the plant nucleus, the T-DNA is integrated at a random position 

*5 into the plant genome (reviewed by Hooykaas and Beijersbergen 1994, Hansen and Chilton 1999). Removal of the 
onc-genes from the T-DNA does not inactivate T-DNA transfer. T-DNA, disarmed in this way, is now the preferred 
vector for the genetic modification of plants. 

[0003] Although much is known about the transformation process, not much is known about the process by which 
the T-DNA is integrated into the plant genome. It is likely that plant enzymes mediate this step of the transformation 

20 process (Bundock et al. 1995). The integration pattern of T-DNA in transformed plants has been extensively studied 
(Matsumoto et al. 1 990, Gheysen et al. 1 991 , Mayerhofer et al. 1 991 ). The results indicated that T-DNA integrates via 
illegitimate recombination (IR) (also called non-homologous recombination, both terms may be used interchangeable 
herein), a process which can join two DNA molecules that share little or no homology (here the T-DNA and plant target 
DNA). Even T-DNA molecules in which a large segment of homologous plant DNA was present, integrated mainly by 

25 |R and only with very low frequency (1 ilOMO 5 ) by homologous recombination (HR) (Offringa et al. 1990). 

[0004] Recently, it was shown that Agrobacterium is not only able to transfer its T-DNA to plant cells, but also to 
other eukaryotes, including the yeast S, cerevisiae (Bundock et al. 1 995) and a wide variety of filamentous fungi (deGroot 
et al. 1 998). In S.cerevisiae, T-DNA carrying homology with the yeast genome integrates via HR (Bundock et al. 1 995). 
However, T-DNA lacking any homology with the S.cerevisiae genome becomes integrated at random positions in the 

30 genome by the same IR process as is used in plants (Bundock and Hooykaas 1 996). Apparently, eukaryotic cells have 
at least two separate pathways (one via homologous and one via non-homologous recombination) through which 
nucleic acids (in particular of course DNA), can be integrated into the host genome. The site of integration into a host 
cell genome is important with respect to the likelihood of transcription and/or expression of the integrated nucleic acid. 
The present invention provides methods and means to direct nucleic acid integration to a predetermined site through 

35 steering integration towards the homologous recombination pathway. The present invention arrives at such steering 
either by enhancing the HR pathway, or by inhibiting (meaning reducing) the IR pathway. 

[0005] Host factors involved in the integration of nucleic acid by IR have so far not been identified. The present 
invention discloses such factors which enables the design of methods for their (temporary) inhibition, so that integration 
of nucleic acid by IR is prevented, shifting the integration process towards HR and facilitating the isolation of a host 
40 cell with nucleic acid integrated by HR at a predetermined site. This is extremely important, since there is no method 
available yet for easy and precise genetic modification of a host cell using HR (gene targeting). Of course the actual 
site of integration is then determined by homology of the nucleic acid of interest with said site. 

[0006] In a first embodiment the invention provides a method to direct nucleic acid integration of a nucleic acid of 
interest to a pre-determined site, whereby said nucleic acid has homology at or around the said pre-determined site, 

45 in an eukaryote with a preference for non-homologous recombination comprising steering an integration pathway to- 
wards homologous recombination. Integration is a complex process wherein a nucleic acid sequence becomes part 
of the genetic material of a host cell. One step in the process of nucleic acid integration is recombination; via recom- 
bination nucleic acid sequences are exchanged and the introduced nucleic acid becomes part of the genetic material 
of a host cell. In principle two different ways of recombination are possible: homologous and illegitimate or non-homol- 

50 ogous recombination. Most (higher) eukaryotes do not or at least not significantly practise homologous recombination 
although the essential proteins to accomplish such a process are available. One reason for this phenomenon is that 
frequent use of homologous recombination in (higher) eukaryotes could lead to undesirable chromosomal rearrange- 
ments due to the presence of repetitive nucleic acid sequences. To accomplish homologous recombination via a method 
according to the invention, it is important to provide a nucleic acid which has homology with a pre-determined site. It 

55 is clear to a person skilled in the art that the percentage of homology and the length of (a) homologous region(s) play 
an important role in the process of homologous recombination. The percentage of homology is preferably close to 
100%. A person skilled in the art is aware of the fact that lower percentage of homology are also used in the field of 
homologous recombination, but dependent on, for example, the regions of homology and their overall distribution, lead 
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to a lower efficiency of homologous recombination but are still useful and therefore included in the present invention. 
Furthermore, the length of a (nearly) homologous region is approximately 3 kb which is sufficient to direct homologous 
recombination. At least one homologous region is necessary for recombination but more preferably 2 homologous 
regions flanking the nucleic acid of interest are used for targeted integration. The researcher skilled in the art knows 
how to select the proper percentage of homology, the length of homology and the amount of homologous regions. By 
providing such a homology a nucleic acid is integrated at every desired position within the genetic material of a host 
cell. It is clear to a person skilled in the art that the invention as disclosed herein is used to direct any nucleic acid 
(preferably DNA) to any pre-determined site as long as the length of homology and percentage of homology are high 
enough to provide homologous recombination. A pre-determined site is herein defined as a site within the genetic 
material contained by a host cell to which a nucleic acid with homology to this same site is integrated with a method 
according to the invention. It was not until the present invention that a nucleic acid is integrated at every desired position 
and therefore a method according to the invention is applied, for example, to affect the gene function in various ways, 
not only for complete inactivation but also to mediate changes in the expression level or in the regulation of expression, 
changes in protein activity or the subcellular targeting of an encoded protein. Complete inactivation, which can usually 
not be accomplished by existing methods such as antisense technology or RNAi technology (Zrenner et al, 1993), is 
useful for instance for the inactivation of genes controlling undesired side branches of metabolic pathways, for instance 
to increase the quality of bulk products such as starch, or to increase the production of specific secondary metabolites 
or to inhibit formation of unwanted metabolites. Also to inactivate genes controlling senescence in fruits and flowers 
or that determine flower pigments. Replacement of existing regulatory sequences by alternative regulatory sequences 
is used to alter expression of in situ modified genes to meet requirements, (e.g. expression in response to particular 
physical conditions such as light, drought or pathogen infection, or in response to chemical inducers), or depending 
on the developmental state (e.g. in a storage organ, or in fruits or seeds) or on tissue or cell types. Also a method 
according to the invention is used to effectuate predictable expression of transgenes encoding novel products, for 
example by replacing existing coding sequences of genes giving a desired expression profile by those for a desired 
novel product. For example to produce proteins of medicinal or industrial value in the seeds of plants the coding se- 
quence of a strongly expressed storage protein may be replaced by that of the desired protein. As another example 
existing coding sequences are modified so that the protein encoded has optimized characteristics for instance to make 
a plant herbicide tolerant, to produce storage proteins with enhanced nutritional value, or to target a protein of interest 
to an organelle or to secrete it to the extracellular space. As yet another example eukaryotic cells (including yeast, 
fungi, plant and mammalian cells) are provided with a gene encoding a protein of interest integrated into the genome * 
at a site which ensures high expression levels. As another example the nucleic acid of interest can be part of a gene 
delivery vehicle to deliver a gene of interest to a eukaryotic cell in vitro or in vivo. In this way a defect p53 can be 
replaced by an intact p53. In this way a tumoricidal gene can be delivered to a pre-determined site present only in e. 
g. proliferating cells, or present only in tumor cells, e.g. to the site where a tumor antigen is expressed from. Gene 
delivery vehicles are well known in the art and include adenoviral vehicles, retroviral vehicles, non-viral vehicles such 
as liposomes, etc. As another example the invention is used to produce transgenic organisms. Knock-out transgenics 
are already produced by homologous recombination methods. The present invention improves the efficiency of such 
methods. Also transgenics with desired properties are made. 

[0007] In another embodiment the invention provides a method to direct nucleic acid integration to a pre-determined 
site, whereby said nucleic acid has homology at or around the said pre-determined site, in an eukaryote with a pref- 
erence for non-homologous recombination comprising steering an integration pathway towards homologous recombi- 
nation by providing a mutant of a component involved in non-homologous recombination. Methods to identify compo- 
nents involved in non-homologous recombination are outlined in the present description wherein S.cerevisiae was 
used as a model system. To this end several yeast derivatives defective for genes known to be involved in various 
recombination processes were constructed and the effect of the mutations on T-DNA integration by either HR or IR 
was tested. The results as disclosed herein show that the proteins encoded by YKU70, RAD50, MRE11, XRS2, LIG4 
and SIR4 play an essential role in DNA integration by IR but not by HR. It is clear to a person skilled in the art that 
different mutants of a component involved in non-homologous recombination exist. Examples are deletion mutants, 
knock-out (for example via insertion) mutants or point mutants. Irrespective of the kind of mutant it is important that a 
component involved in non-homologous recombination is no longer capable or at least significantly less capable to 
perform it's function in the process of non-homologous recombination. As disclosed herein disruption of YKU70, RAD50, 
MRE11, XRS2, LIG4 and SfR4 did not affect the frequency of DNA integration by HR, showing that these genes are 
not involved in DNA integration by HR, but only in DNA integration by IR. In another embodiment the invention provides 
a method to direct integration of a nucleic acid of interest to a subtelomeric and/or telomeric region in an eukaryote 
with a preference for non-homologous recombination by providing a mutant of a component involved in non-homolo- 
gous recombination. A telomeric region is defined herein as a region containing repetitive sequences which is located 
at the end of a chromosome. Sub-telomeric region is herein defined as a region flanking the telomeric region. As an 
example is disclosed herein that in yeast strains carrying disruptions of RAD50, MRE11 or XRS2 the distribution of 
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integrated DNA copies is altered when compared to wildtype. DNA becomes preferentially integrated in telomeres or 
subtelomeric regions in the radSO, mre 1 1 and xrs2 mutants. A g reat advantage of integration of DNA copies in telomeres 
or subtelomeric regions instead of integration elsewhere in the genomic material is that there is no danger for host 
genes being mutated or inactivated by a DNA insertion. When in plants deficient for RAD50, MRE 11 or XRS2 DNA 
5 copies also integrate into telomeres or telomeric regions, such plants are used for telomeric targeting of T-DNA in 
transformation experiments to prevent additional insertion mutations from random T-DNA integration into the plant 
genome. 

[0008] In yet another embodiment the invention provides a method to direct nucleic acid integration to a pre-deter- 
mined site, whereby said nucleic acid has homology at or around the said pre-determined site, in an eukaryote with a 
10 preference for non-homologous recombination comprising steering an integration pathway towards homologous re- 
combination by partially or more preferably completely inhibiting a component involved in non-homologous recombi- 
nation. Partial or complete inhibition of a component involved in non-homologous recombination is obtained by different 
methods, for example by an antibody directed against such a component or a chemical inhibitor or a protein inhibitor 
or peptide inhibitor or an antisense molecule or an RNAi molecule. Irrespective of the kind of (partial or more preferably 
complete) inhibition it is important that a component involved in non-homologous recombination is no longer capable 
or at least significantly less capable to perform it's function in the process of non-homologous recombination. In yet 
another embodiment the invention provides a method to direct integration of a nucleic acid of interest to a sub-telomeric 
and/or telomeric region in an eukaryote with a preference for non-homologous recombination by partially or more 
preferably completely inhibiting a component involved in non-homologous recombination. 
20 [0009] In a preferred embodiment the invention provides a method to direct nucleic acid integration to a pre-deter- 
mined site or to a sub-telomeric and/or telomeric region by providing a mutant of a component involved in non-homol- 
ogous recombination or by partially or more preferably completely inhibiting a component involved in non-homologous 
recombination wherein said component comprises ku70, rad50, mrel 7, xrs2, lig4ox sir4. Components involved in non- 
homologous recombination are identified as outlined in the present description. The nomenclature for genes as used 
25 above is specific for yeast. Because the nomenclature of genes differs between organisms a functional equivalent or 
a functional homologue (see for example figure 2 to 5) and/or a functional fragment thereof, all defined herein as being 
capable of performing (in function, not in amount) at least one function of the yeast genes ku70, radSO, mrel 7, xrs2 y 
Hg4 or sir4 are also included in the present invention. A mutant of a component directly associating with a component 
involved in non-homologous recombination or (partial or complete) inhibition of a component directly associating with 
30 a component involved in non-homologous recombination is also part of this invention. Such a component directly 
associating with a component involved in non-homologous recombination is, for example, identified in a yeast two 
hybrid screening. An example of a component directly associating with a component involved in non-homologous 
recombination is KU80, which forms a complex with KU70. In a more preferred embodiment the invention provides a 
method to direct nucleic acid integration in yeast, fungus, plant or animal. 
35 [0010] In another embodiment the invention provides a method to direct nucleic acid integration to a pre-determined 
site, whereby said nucleic acid has homology at or around the said pre-determined site, in an eukaryote with a pref- 
erence for non-homologous recombination comprising steering an integration pathway towards homologous recombi- 
nation by transiently (partially or more preferably completely) inhibiting integration via non-homologous recombination. 
In yet another embodiment the invention provides a method to direct integration of a nucleic acid of interest to a sub- 
40 telomeric and/or telomeric region in an eukaryote with a preference for non-homologous recombination by transiently 
(partially or more preferably completely) inhibiting integration via non-homologous recombination. In a more preferred 
embodiment such a method is used for yeast, plant or fungus and the transient (partial or more preferable complete) 
inhibition is provided by an Agrobacterium Vir-fusion protein capable of (partially or more preferably completely) inhib- 
iting a component involved in non-homologous recombination or capable of (partially or more preferably completely) 
inhibiting a functional equivalent or homologue thereof or capable of (partially or more preferably completely) inhibiting 
a component directly associating with a component involved in non-homologous recombination. In a even more pre- 
ferred embodiment such a Agrobacterium Vir fusion protein comprises VirF or VirE2. It was shown that the Agrobac- 
terium VirF and VirE2 proteins are directly transferred from Agrobacterium to plant cells during plant transformation 
(Vergunst et al. 2000). To, for example, accomplish T-DNA integration by HR in plants, VirF fusion proteins containing 
for example a peptide inhibitor of IR in plant cells are introduced concomitantly with the targeting T-DNA. It has been 
reported that the C-terminal part (approximately 40 amino acids) of VirF or VirE2 is sufficient to accomplish transfer of 
T-DNA. A functional fragment and/or a functional equivalent of VirF or VirE is therefore also included in the present 
invention. In an even more preferred embodiment a component involved in non-homologous recombination comprises 
ku70, radSO, mrel 7, xrs2, Iig4 or sir4 or functional equivalents or homologous thereof or associating components. The 
55 nomenclature for genes as used above is specific for yeast. Because the nomenclature of genes differs between 
organisms a functional equivalent or a functional homologue (see for example figure 2 to 5) and/or a functional fragment 
thereof, all defined herein as being capable of performing (in function, not in amount) at least one function of the yeast 
genes ku70 t rad50, mre 7 7, xrs2 y iig4 or sir4 are also included in the present invention. By transiently (partially or more 
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preferably completely) inhibiting a component involved in non-homologous recombination a nucleic acid is integrated 
at any position without permanently modifying a component involved in non-homologous recombination and preventing 
unwanted side effects caused by the permanent presence of such a modified component involved in non-homologous 
recombination. 

[0011] Methods according to the present invention, as extensively discussed above, are used in a wide variety of 
applications. One embodiment of the present invention is the replacement of an active gene by an inactive gene ac- 
cording to a method of the invention. Complete inactivation, which can usually not be accomplished by existing methods 
such as antisense technology or RNAi technology, is useful for instance for the inactivation of genes controlling unde- 
sired side branches of metabolic pathways, for instance to increase the quality of bulk products such as starch, or to 
increase the production of specific secondary metabolites or to inhibit formation of unwanted metabolites. Also to 
inactivate genes controlling senescence in fruits and flowers or that determine flower pigments. Another embodiment 
of the present invention is the replacement of an inactive gene by an active gene. One example is the replacement of 
an defect p53 by an intact p53. Many tumors acquire a mutation in p53 during their development which renders it 
inactive and often correlates with a poor response to cancer therapy. By replacing the defect p53 by an intact p53, for 
example via gene therapy, conventional anti cancer therapy have better changes of succeeding. In yet another em- 
bodiment of the invention a therapeutic proteinaceous substance is integrated via a method of the invention. In this 
way a tumoricidal gene can be delivered to a p re-determined site present only in e.g. proliferating cells, or present only 
in tumor cells, e.g. to the site where a tumor antigen is expressed from. In yet another embodiment the invention 
provides a method to introduce a substance conferring resistance for an antibiotic substance to a cell according to a 
method of the invention. Also a method according to the invention is used to confer a desired property to an eukaryotic 
cell. In an preferred embodiment method a gene delivery vehicle is used to deliver a desired nucleic acid to a pre- 
determined site. Gene delivery vehicles are well known in the art and include adenoviral vehicles, retroviral vehicles, 
non-viral vehicles such as liposomes, etc.. In this way, a for example, tumoricidal gene can be delivered to a pre- 
determined site present only in e.g. proliferating ceils, or present only in tumor cells, e.g. to the site where a tumor 
antigen is expressed from. 

[0012] Furthermore a method according to the invention is used to improve gene targeting efficiency. Such a method 
is used to improve for example the gene targeting efficiency in plants. In plants transgenes integrate randomly into the 
genome by IR (Mayerhofer et al. 1991, Gheysen et al. 1991). The efficiency of integration by HR is very low, even 
when large stretches of homology between the transgene and the genomic target site are present (Offringa et al. 1 990)1 
Therefore, the efficiency of gene targeting using HR is very low in plants. The results that are disclosed herein shoW 
how to improve the gene targeting efficiency in plants. From the fact that T-DNA integration by IR is strongly reduced 
in KU70, RAD50, MRE 1 /, XRS2, LIG4 and SIR4 deficient yeast strains and T-DNA integration by HR is not affected 
in such strains, we infer that also in plants, deficient for either of these genes, T-DNA integration by HR is more easily 
obtained. Recently, we have cloned a KU70 homologue of Arabidopsis tha/iana (see figure 2, Bundock 2000, unpub- 
lished data). RAD50, MRE11 and UG4 homologues have already been found in A.thaiiana (GenBank accession num- 
bers AF1 68748, AJ243822 and AF233527, respectively, see also figure 3, 4 and 5 (Hartung and Puchta 1999). Cur- 
rently, screenings are being performed to find plants carrying a T-DNA inserted in AtMREI r, AtKU70or AtLIG4. These 
knockout plants are used to test whether T-DNA integration by IR is reduced and integration by HR is unaffected, 
thereby facilitating the detection of T-DNA integration by HR. 

[0013] The invention will be explained in more detail in the following description, which is not limiting the invention. 
EXPERIMENTAL PART 
Yeast strains. 

[0014] The yeast strains that were used are listed in Table 1 . Yeast mutants isogenic to the haploid YPH250 strain 
were constructed using the one-step disruption method (Rothstein 1991). A 1987 bp fragment from the YKU70 locus 
was amplified by PCR using the primers hdflpl S'-GGGATTGCTTTAAGGTAG-S' and hdf1p2 5'-CAAATACCCTAC- 
CCTACC-3'. The PCR product was cloned into pT7Blue (Novagen) to obtain pT7Blue YKU70. A 1177 bp EcoRV/Hin6\\\ 
fragment from the YKU70OFLF was replaced by a 2033 bp H/ndlll/Smal LEU2 containing fragment from pJJ283 (Jones 
and Prakash 1990), to form pT7Blue YKU70::LEU2. In order to obtain YKU70 disruptants Leu + colonies were selected 
after transformation of YPH250 with a 2884 bp A/del/Smotl fragment from pT7Blue YKU70::LEU2. The Expand™ High 
Fidelity System (Boehringer Mannheim) was used according to the supplied protocol to amplify a 3285 bp fragment 
from the LIG4 locus with primers 



dnl4pl 5-CGTAAGATTCGCCGAGTATAG-3' 



and 
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dnl4p2 5 , -CGTTTCAAATGGGACCACAGC-3\ 

The PCR product was cloned into pGEMT (Promega), resulting in pGEMT/_/G4. A 1 326 bp BamH\/Xho\ fragment from 
pJJ215 (Jones and Prakash 1990) containing the HIS3 gene was inserted into the BamH\ and Xho\ sites of plC20R, 
resulting in plC20RH/S3. A 782 bp EcoRI fragment from the UG4 ORF was replaced with a 1367 bp EcoRI HIS3 
containing fragment from plC20RH/S3to construct pGEMTL/G4;;H/S3. In order to obtain LIG4 disruptants His+ colonies 
were selected after transformation of YPH250 with a 3854 bp NcoVNotl fragment from pGEMT LIG4::HIS3. In order to 
obtain RAD50 disruptants YPH250 was transformed with a EcoRVBg/U fragment from pNKY83 and Ura + colonies were 
selected (Alani et al. 1989). A rad50::hisG strain was obtained by selecting Ura- colonies on selective medium con- 
taining 5-FOA. Similarly RAD51 disruptants were obtained were obtained after transformation of YPH250 with a 
RAD51::LEU2Xba\/Pst\ fragment from pDG1 52 and selection of Leu+ colonies (Schiestl et al. 1 994). The TRP1 marker 
in pSM21 (Schild et al. 1983) was replaced with a Bgl\\/Xba\ LEU2 containing fragment from pJJ283 (Jones and Pra- 
kash, 1990). This resulted in pSM21 LEU2. Leu+ RAD52 disruptant colonies were selected after transformation of 
YPH250 with the RAD52::LEU2 BamH/fragment from pSM21 LEU2. Disruption constructs were transformed to YPH250 
by the lithium acetate transformation method as described (Gietz et al. 1992). Disruption of YKU70 t UG4, RAD50, 
RAD51 and RAD52 was confirmed by PCR and Southern blot analysis. 



Table 1 : 



Yeast strains 


Strain 


Genotype 


Reference 


YPH250 


MATa, ura3-52, Iys2-801, ade2-101 t trp1-A1, his3-A200, Ieu2-A1 


Sikorski and Hieter 1 989 


YPH250rad57 


MATa, ura3-52, Iys2-8Q1, ade2-101, trp1-A1, his3-A200, Ieu2-A1, 


This study 




rad51::LEU2 


YPH250rarf52 


MATa, ura3-52, Iys2-801, ade2-101, trp1-A1, his3-A200, Ieu2-A1, 


This study 




rad52::LEU2 


YPH250y/cu70 


MATa, ura3-52, Iys2-801, ade2-101, trp1-A1, NS3-A200, Ieu2-A1, 


This study 




yku70::LEU2 


YPH250rad50 


MATvl, ura3-52, Iys2~801, ade2-101, trp1-A1 t his3-A200, Ieu2-A1 t 


This study 




rad50::hisG 


YPH250//g4 


MATa, ura3-52, Iys2-801, ade2-101, trp1-A1, his3-A200, Ieu2-A1, 


This study 




fig4::HIS3 


JKM115 


Aho, AhmlrADEI, MATa, Ahmr::ADE1, adel, leu2-3, 1 12, Iys5, trp1:: 
hisG, ura3-52 


Moore and Haber 1996 


JKM129 


Aho, Ahml::ADEh MATa, Ahmr::ADE1 , adel, \eu2-3, 1 12, /ys5, trp1:: 
hisG, ura3-52 y xrs2::LEU2 


Moore and Haber 1996 


JKM138 


Aho, Ahml::ADE1, MATa, Ahmrr.ADE 1, ade1> leu2-3, 1 12, Iys5, trp1:: 
hisG, ura3-52, mre11::hisG 


Moore and Haber 1996 


YSL204 


Aho, HMLa, MATa, HMRa, ade1-100, Ieu2-3,112, Iys5, trp1::hisG, 
ura3-52, hisG'-URA3-hisG\ sir4::HIS3 


Lee et al. 1999 



Construction of binary vectors. 



[0015] To construct pSDM8000 a 1513 bp Pvull/EcoRV fragment carrying the KanMX marker was obtained from 
pFA6a (Wach et al. 1994) and was iigated into the unique Hpals'We of pSDM14 (Offringa 1992). pSDM8001 was made 
in three cloning steps. A 1476 bp SamHI/EcoRI fragment carrying the KanMX marker was obtained from pFA6a and 
Iigated into BamHI and EcoRI digested plC20H to form plC20H/ca/7MX. The KanMX marker was inserted between the 
PDA1 flanks by replacement of a 2610 bp BglU fragment from pUC4E1a10 (Steensma et al. 1990) with a 1465 BglH 
fragment from plC20HfcanMX. A 3721 bp XhoUKpnl fragment from this construct was inserted into the Xho\ and Kpn\ 
sites of pSDM14. The binary vectors pSDM8000 and pSDM8001 were introduced into Agrobactehum tumefaciens 
LBA1119 by electroporation (den Dulk-Ras and Hooykaas 1995). 
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Cocultivatlons /T-DNA transfer experiments. 

[0016] Cocuftivations were performed as described earlier with slight modifications (Bundock et al. 1995). Agrobac- 
terium was grown overnight in LC medium. The mix of Agrobacterium and S. cerevisiae cells was incubated for 9 days 
5 at 20°C. G41 8 resistant S.cerevisiae strains were selected at 30°C on YPAD medium containing geneticin (200 n,g/ml) 
(Life Technologies/Gibco BRL). 

Vectorette PCR. 

io [0017] Chromosomal DNA was isolated using Qiagen's Genomic Tips G/20 per manufacters protocol. 1-2 ng of 
Genomic DNA was digested with EcoRI, CAxI, Psfl or W/ndlll and run on a 1% TBE-gel. Non-radioactive Southern 
blotting was performed. The membrane was hybridized with a digoxigenine-labeled kanMX probe to determine the 
size of T-DNA/genomic DNA fragments (EcoRl and C/a! for RB containing fragments and Psfl and H/ndlll for LB con- 
taining fragments). The kanMX probe, a 792 bp internal fragment of the KanMX marker, was made by PCR using 

15 primers kanmxpl 5'-AGACTCACGTTTCGAGGCC-3' and kanmxp2 5'-TCACCGAGGCAGTTCCATAG-3' and a Non- 
Radioactive DNA Labeling and Detection kit (Boehringer Mannheim). The enzyme showing the smallest band on blot 
was used for Vectorette PCR, in order to amplify the smallest junction sequence of T-DNA and genomic DNA. Vectorette 
PCR was performed as described 

(http://genomewww.stanford.edu/group/botlab/protocolsA/ectorette.html). The Expand™ High Fidelity System (Boe- 
20 hringer Mannheim) was used to amplify fragments larger than 2.5 kb, whereas sTaq DNA polymerase (SphaeroQ) was 
used for amplification of fragments smaller than 2.5 kb. Primer kanmxp3 5'-TCGCAGGTCTGCAGCGAGGAGC-3* and 
kanmxp4 ^-TCGCCTCGACATCATCTGCCCAG-S' were used to amplify RB/genomic DNA and LB/genomic DNA junc- 
tion sequences, respectively. 

25 T7 DNA Polymerase sequencing. 

[0018] Vectorette PCR products were cloned in pGEMTEasy (Promega) and sequenced using the T7 polymerase 
sequencing kit (Pharmacia) according to manufacturers protocol. In order to obtain sequences flanking the RB and 
LB, primers kanmxp5 5'-TCACATCATGCCCCTGAGCTGC-3' and kanmxp4 were used, respectively. 

30 

RESULTS 

1. Binary vectors for T-DNA transfer to yeast. 

35 [001 9] It was previously demonstrated that Agrobacterium tumefaciens is able to transfer its T-DNA not only to plants 
but also to another eukaryote, namely the yeast Saccharomyces cerevisiae (Bundock et al. 1995). T-DNA carrying 
homology with the yeast genome was shown to become integrated by homologous recombination. T-DNA lacking any 
homology with the yeast genome was integrated randomly into the genome by IR like in plants (Bundock et al. 1 995, 
Bundock and Hooykaas 1 996). The T-DNA used in these experiments carried the S.cerevisiae URA3gene for selection 

40 of Ura+ colonies after T-DNA transfer to the haploid yeast strain RSY12(URA3A). However, in this system only yeast 
strains could be used in which the URA3 gene had been deleted to avoid homology between the incoming T-DNA and 
the S.cerevisiae genome. 

[0020] We wanted to setup a system in which T-DNA transfer to any yeast strain could be studied. Therefore, two 
new binary vectors were constructed using the dominant marker kanMX (Wach et al. 1994), which confers resistance 

45 against geneticin (G41 8). The T-DNA of pSDM8000 carries only the KanMX marker. Since this KanMX marker consists 
of heterologous DNA, tacking any homology with the S.cerevisiae genome, we would expect this T-DNA to integrate 
by IR at a random position in the yeast genome. To be able to compare this with T-DNA integration by homologous 
recombination pSDM8001 was constructed. The T-DNA of pSDM8001 carries the KanMX marker flanked by sequences 
from the S.cerevisiae PDA 1 locus. The PDA 1 sequences have been shown to mediate the integration of T-DNA by HR 

50 at the PDA1 iocus on chromosome V (Bundock et al. 1995). 

[0021] Cocultivations between Agrobacterium strains carrying pSDM8000 and pSDM8001 , respectively, and the hap- 
loid yeast strains YPH250 and JKM115, respectively, were carried out as described in the experimental part. G418 
resistant colonies were obtained at low frequencies for YPH250 (1.6 x 10 7 ) and JKM115 (1.2 x 10 5 ) after T-DNA 
transfer from pSDM8000 (Table 2). T-DNA transfer from pSDM8001 generated G418 resistant colonies at higher fre- 

55 quencies (2.4 x 10 5 for YPH250 and 1.8 x 10* 4 JKM115, Table 2). The ratio of homologous recombination versus 
illegitimate recombination is determined by comparing the frequencies of G418 resistant colonies obtained from coc- 
ultivations using either pSDM8001 orpSDM8000. This showed that a T-DNA from pSDM8001 was 150-fold more likely 
to integrate than a T-DNA from pSDM8000 in YPH250 (Table 2). A similar difference was previously seen using T-DNAs 
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with the URA3 marker (Bundock and Hooykaas 1996). In contrast, T-DNA from pSDM8001 was only 16-fold more 
likely to integrate than a T-DNA from pSDMSOOO in JKM115. There was no significant difference in the frequency of 
T-DNA transfer to these two yeast strains as was determined by T-DNA transfer experiments in which a T-DNA, that 
carried the KanMXmarker and the yeast 2 micron replicon, was employed. Therefore, the differences in the frequencies 
of T-DNA integration by HR and IR between the yeast strains YPH250 and JKM115, respectively, is most likely con- 
tributed to differences in the capacities of their HR and IR recombination machineries. 

[0022] We confirmed by PCR that the T-DNA from pSDM8001 became integrated at the PDA1 locus by homologous 
recombination (data not shown). In order to find out whether the T-DNA from pSDM8000 had integrated randomly by 
IR yeast target sites for integration were determined from 8 G418 resistant YPH250 colonies by Vectorette PCR (for 
detailed description see materials and methods). Chromosomal DNA was isolated and digested with a restriction en- 
zyme that cuts within the T-DNA. A Vectorette was ligated to the digested DNA and a PCR was performed using a 
T-DNA specific and a Vectorette specific primer. The PCR product obtained was cloned into pGEMTEasy and se- 
quenced using a T-DNA specific primer. The position of the T-DNA insertion was determined by basic BLAST search 
of the yeast genome (http://www-genome.stanford.edu/SGD). We were thus able to map the position of the T-DNA 
insertions of all 8 G418 resistant colonies analyzed. They were present at different positions spread out over the ge- 
nome. Comparison of the T-DNA sequence and yeast target site sequences did not reveal any obvious homology. 
These data show that the T-DNA from pSDMSOOO had integrated via an IR mechanism as expected. 
[0023] The following characteristics have previously been observed for T-DNAs integrated by IR: a) the 3' end of the 
T-DNA is usually less conserved compared to the 5' end, b) microhomology is sometimes present between the T-DNA 
ends and the target site, c) integration is often accompanied by small deletions of the target site DNA (Matsumoto et 
al. 1990, Gheysen et al. 1991, Mayerhofer et al. 1991, Bundock and Hooykaas 1996). Similar characteristics were 
seen in the currently analyzed 8 T-DNA insertions. In 3 strains we observed microhomology of 2 - 6 bp between the 
LB and yeast target site (figure 1, WT.51 was taken as an example), in 5 strains deletions of 1 - 5 bp of yeast target 
site DNA was found and we observed deletions, varying from 1 - 112 bp, of the 3' end of the T-DNA in 7 out of 8 
analyzed strains. In only 1 strain the 3' end appeared to be intact. The 5' end of the T-DNA was conserved in almost 
all strains. In only 2 strains we observed small deletions of 1 and 2 bp at the 5' end of the T-DNA. 
[0024] Thus, we can conclude that the T-DNA from pSDM8000 had integrated via the same IR mechanism described 
before. 



30 Table 2: 





frequencies of T-DNA integration by IR relative to integration by HR in recombination defective yeast strains 




Strain 


Genotype 


Freq. of IR a 


Freq. of HR 


Absolute IR/HR 


Standardized IR/ 












ratio b 


HR ratio c 


35 


YPH250 


WT 


1.6 x 10- 7 


2.4 x 10-5 


0.007 


1 




YPH250 rad51 


rad51A 


1.4 x 10" 7 


1.5 x 10' 6 


0.09 


14 




YPH250 rad52 


rad52A 


3.8 x 10" 7 


2.5 x 10- 6 


0.15 


23 




YPH250 yku70 


yku70A 


<3.2x 10" 9 


3.3 x 10" 5 


<0.0001 


<0.01 


40 


YPH250 rad50 


radSOA 


8.0 x 10-9 


3.5 x 10-5 


0.0002 


0.03 


YPH250 Iig4 


lig4A 


3.7 x 10" 9 


2.3 x 10-5 


0.0002 


0.02 




JKM115 


WT 


1.2 x 10 5 


1.8 x 10- 4 


0.07 


1 




JKM129 


xrs2A 


2.7 x 10 7 


5.1 x 10 5 


0.005 


0.08 


45 


JKM138 


mre11A 


2.9 x 10- 7 


7.5 x 10- 5 


0.004 


0.06 




YSL204 


sir4A 


1.5 x 10-7 


1.8 x 10-5 


0.008 


0.13 



a Averages of 2 or more independant experiments are shown. Frequencies are depicted as the number of G418 resistant colonies devided by the 
output number of yeast cells (cells/ml). 

b The frequency of T-DNA integration by IR (pSDM8000) devided by the frequency of T-DNA integration by HR (pSDM8001). 
c The ratio of IR/HR in the mutant strain devided by the ratio of IR/HR in the wildtype strain. 



2. Host-specific factors involved In random T-DNA integration. 

[0025] The observation that the T-DNA from pSDMSOOO integrates by IR into the yeast genome allowed us to use 
this system to study the effect of host factors on the process of integration. Many proteins involved in various forms of 
DNA recombination have been identified in yeast. In order to determine the roles of a representative set of these 
enzymes in T-DNA integration, we compared T-DNA transfer and integration in wildtype yeasts with that of strains 
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carrying disruptions of the genes encoding several recombination proteins. The RAD51, RAD52, KU70, RAD50 and 
LIG4 genes were deleted from YPH250 using the one step disruption method. Yeast strains carrying deletions in 
MRE11, XRS2 and SIR4 in the JKM1 15 background were kindly provided by Dr. J. Haber. The results of cocultivations 
with these yeast strains are shown in Table 2. 

[0026] In rad51 and rad52 mutants, which are impaired in homologous recombination, the frequency of T-DNA in- 
tegration by HR was 16- and 9-fold lower, respectively, than observed for the wildtype (Table 2). This showed that 
RAD51 and RAD52 play a role in T-DNA integration by homologous recombination. In the IR defective ku70, rad50, 
Iig4, rnre11, xrs2 and sir4 mutants the frequency of T-DNA integration by HR did not differ significantly from that ob- 
served for wildtype (Table 2). This showed that these genes do not play a role in T-DNA integration by homologous 
recombination. 

[0027] The frequency of T-DNA integration by IR in a rad51 mutant did not differ significantly from that observed for 
wildtype, whereas in a rad52 mutant the frequency was about 2-fold higher (Table 2). This showed that RAD51 and 
RAD52 are not involved in T-DNA integration by IR. The product of the RAD52 gene may compete with IR-enzymes 
for the T-DNA and thereby inhibits integration by IR to some extent. Strikingly, in radSO, mre11, xrs2, Iig4 and sir4 
mutants the frequency of T-DNA integration by IR was reduced dramatically: 20- to more than 40-fold (Table 2). T-DNA 
integration by IR seemed to be completely abolished in the ku70 mutant. We did not obtain any G41 8 resistant colonies 
from several cocultivation experiments. This strongly suggests that KU70 plays an important role in random T-DNA 
integration in yeast. 

[0028] Since T-DNA integration by HR is normal in these mutants, these results clearly show that the yeast genes 
KU70, RAD50, MRE11, XRS2, LIG4 and SIR4 are involved in T-DNA integration by illegitimate recombination. 

3. Chromosomal distribution of Integrated T-DNA copies In IR defective S.cerevlslae. 

[0029] From several cocultivation experiments with the rad50, mrel 7, xrs2, Iig4 and sir4 mutants we obtained a small 
number of G418 resistant colonies. The T-DNA structure was determined for a number of these lines. To this end 
chromosomal DNA was isolated from these G41 8 resistant colonies and subjected to vectorette PCR to amplify junction 
sequences of genomic DNA and T-DNA. PCR products were cloned and sequenced. The yeast sequences linked to 
the T-DNA were used in a BLAST search at http://www-genome.stanford.edu/SGD to determine the T-DNA integration 
sites. 

[0030] Strikingly, analysis of LB/genomic DNA junctions revealed that in 2 out of 3 rad50, 4 out of 6 mrel 1 and 2 
xrs2 strains analyzed, T-DNAs had integrated in telomeres or subtelomeric regions (rad50k.1, rad50k.6, mre11k.8, 
mre11k.11, mrellk.14, mre11k.17 t xrs2k.1 and xrs2k.17; Table 3 and figure 1). S. cerevisiae telomeres generally 
consist of one or more copies of the Y' element followed by telomerase-generated C(1-3)A/TG(1-3) repeats (Zakian 
1 996). In 2 rad50 strains, 2 mrel 1 strains and 1 xrs2 strain the LB was found to be fused to this typical telomerase- 
generated C(1 -3)A/TG(1 -3) repeat (rad50k. l/radSOk.6, mre 1 1k. 14, mrel 1k. 17 and xrs2k. 7; figure 1 ). Besides, we also 
found one T-DNA insertion in a Ty LTR element in the mrel 1 mutant and 2 insertions in the rDNA region, present in 
chromosome XII, in the mrel 1 and rad50 mutants (mrel 1k.5, mrel 1k.4 and rad50k.5, respectively; Table 3 and figure 
1). 

[0031] The 3' end of the T-DNA was truncated in all strains. Deletions of 3 - 11 bp of the 3'end of the T-DNA were 
observed (figure 1). Microhomology between the 3' end of the T-DNA and yeast target site was only found in 2 lines 
(5 bp in mre11k.4 and 4 bp in mrel 1k.14; figure 1). For the T-DNA copies present at the yeast telomeres, the RB/ 
genomic DNA junction sequences could not be obtained from these strains using vectorette PCR. This was only pos- 
sible for the radSO and mre11 strains carrying the T-DNA in the rDNA region on chromosome XII. In both strains the 
RB was intact and no homology between the 5* end of the T-DNA and the yeast target site was found (data not shown 
in figure 1). 

[0032] Previously, target sites for T-DNA integration in the genome of S.cerevisiae strain RSY12 were determined 
(Bundock and Hooykaas 1 996, Bundock 1 999). In 4 out of 44 strains analyzed, T-DNA copies were integrated, in rDNA, 
Ty LTR elements (in 2 strains) and a subtelomeric located Y' element, respectively. In addition, we determined the 
position of T-DNA integration in ten S.cerevisiae YPH250 strains. We did not find any T-DNA insertions in rDNA, LTR 
elements or subtelomeric/telomeric regions amongst these ten. Pooling all insertions analyzed in wildtype (54), in 2 
out of 54 strains analyzed (4%) insertions were found in a Ty LTR element and in two other strains in the rDNA repeat 
(2%) and a subtelomeric region (2%), respectively. In contrast, we report here that T-DNA in yeast strains mutated in 
RAD50, MRE11 or XRS2 T-DNA integrates preferentially in (sub)telomeric regions (8 out of 11 lines: -73%) of rad50, 
mrel 1 and xrs2 mutants (table 3). From the remaining strains two T-DNAs were present in rDNA and one in a Ty LTR 
element, respectively. Apparently, the rDNA repeat is also a preferred integration site in these mutants (-18% vs. -2% 
in the wildtype). 

[0033] Telomeres consist of a large array of telomerase-generated C(1-3)A/TG(1-3) repeats (-350 bp). In the sub- 
telomeric regions two common classes of Y' elements, 6.7 and 5.2 kb, can be found (in most strains chromosome I 
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does not contain V) (Zakian and Blanton 1988), making the average size of these regions -6,0 kb. Thus, the yeast 
genome contains (16 x 2 x 0.35) + (15 x 2 x 6,0) = 191 kb of subtelomeric/telomeric sequences. The yeast genome is 
12,052 kb in size, which means that only 1.6% of the genome consists of subtelomeric/telomeric sequences. In ac- 
cordance with this, we observed in only 2% of the wildtype strains T-DNA copies inserted in a subtelomeric region, 
which we would expect on the basis of random T-DNA integration. In contrast, in the rad50 t mre11 and xrs2 mutants 
73% of the T-DNA insertions were found in the (sub)telomeric region. 

[0034] Analysis of 7 lines revealed that in the sir4 mutant T-DNA was integrated randomly into the yeast genome. 
So, although SIR4 has an effect on the efficiency of T-DNA integration by IR, the pattern of T-DNA distribution in the 
transformants seems similar as in the wildtype strain. In the sir4 mutant T-DNA integration by IR was characterized by 
truncation of the 3' end of the T-DNA, deletions at the target site and microhomology between the LB and the target 
site (data not shown), like this was observed for T-DNA integration by IR in the wildtype. 

[0035] These results clearly show that in the rad50, mrel 1 and xrs2 mutants the T-DNA, if integrated at all, becomes 
preferentially inserted in telomeres or subtelomeric regions and that the genomic distribution of integrated T-DNAs is 
altered when compared to wildtype. However, disruption of SIR4 did affect the efficiency of T-DNA integration by IR, 
but not the characteristics of this process. 



Table 3: 



genomic distribution of T-DNA integrated by IR in rad50, mre11 and xrs2 mutants in comparison with the wildtype 


after T-DNA transfer from pSDM8000 








Yeast strain 


(Sub)Telomehc region 


LTR 


rDNA 


Elsewhere 


rad50 mutant 


2 


0 


1 


0 


mrel 1 mutant 


4 


1 


1 


0 


xrs2 mutant 


2 


0 


0 


0 


wildtype 


1 


2 


1 


50 



DESCRIPTION OF FIGURES 



30 [0036] Figure 1 : Junction sequences of T-DNA and S.cerevisae genomic DNA. S.cerevisiae YPH250 (WT), rad50, 
mre11 and xrs2 strains were cocultivated with LBA1119(pSDM8000). G418 resistant colonies were obtained. Chro- 
mosomal DNA was isolated and subjected to Vectorette PCR to determine the sequence of genomic DNA flanking the 
T-DNA. Position of T-DNA integration was determined by basic BLAST search of the yeast genome at http:/www. 
genome-stanford.edu/SGD. The Watson strand of genomic DNA that is fused to the LB or RB is shown in italics. Bold 

35 sequences represent sequence homology between the LB and target site. The filler DNA sequence is underlined and 
depicted in italics. The numbers above the LB sequences represents the number of bp deleted from the LB. Tel. = 
telomeric, Subtel. = subtelomeric and Int. = intergenic. 

[0037] Figure 2: Alignment of KU70 homologues. Sc = Saccharomyces cerevisiae, Hs = Homo sapiens and At = 
Arabidopsis thaiiana. Perfect identity is depicted as black boxes, homology is depicted as grey boxes and dashes were 
40 used to optimise alignment. 

[0038] Figure 3: Alignment of LIG4 homologues. Sc = Saccharomyces cerevisiae, Hs = Homo sapiens and At = 
Arabidopsis thaiiana. Perfect identity is depicted as black boxes, homology is depicted as grey boxes and dashes were 
used to optimise alignment. 

[0039] Figure 4: Alignment of MRE1 1 homologues. Sc = Saccharomyces cerevisiae, Hs = Homo sapiens and At = 
45 Arabidopsis thaiiana. Perfect identity is depicted as black boxes, homology is depicted as grey boxes and dashes were 
used to optimise alignment. 

[0040] Figure 5: Alignment of RAD50 homologues. Sc = Saccharomyces cerevisiae, Hs = Homo sapiens and At = 
Arabidopsis thaiiana. Perfect identity is depicted as black boxes, homology is depicted as grey boxes and dashes were 
used to optimise alignment. 

50 
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SEQUENCE LISTING 

<110> Universiteit Leiden 

Stichting Binair Vector Systeem 

<120> Nucleic acid integration in eukaryotes 

<130> P54997EP00 

<140> 00204693.6 
<141> 2000-12-22 

<160> 34 

<170> Patent In Ver. 2.1 

<210> 1 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer hdflpl 
<220> 

<221> misc_feature 
<222> (1) . . (18) 

<400> 1 

gggattgctt taaggtag 



<210> 2 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> misc_feature 
<222> (1) . . (18) 

<220> 

<223> Description of Artificial Sequence: primer hdflp2 
<400> 2 

caaataccct accctacc 



<210> 3 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer dnl4pl 
<220> 

<221> mis cofeature 
<222> (1) . - (21) 

<400> 3 

cgtaagattc gccgagtata g 



<210> 4 
<211> 21 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer dnl4p2 
<40O> 4 

cgtttcaaat gggaccacag c 



<210> 5 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer kanmxpl 
<220> 

<221> misc^feature 
<222> (1) . . (19) 

<400> 5 

agactcacgt ttcgaggcc 



<210> 6 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer kanmxp2 
<220> 

<221> misc_feature 
<222> (1) . . (20) 

<400> 6 

tcaccgaggc agttccatag 



<210> 7 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer kanmxp3 
<220> 

<221> misc_feature 
<222> (1) . . (22) 

<400> 7 

tcgcaggtct gcagcgagga gc 

<210> 8 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer kanmxp4 
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<220> 

<221> misc_f eature 
<222> (1)..(23) 

<400> 8 

tcgcctcgac atcatctgcc cag 



23 



10 



15 



20 



<210> 9 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
<220> 

<221> mi sc_f eature 
<222> (1) . . (22) 

<400> 9 

tcacatcatg cccctgagct gc 



primer kanmxpS 



22 



25 



30 



<210> 10 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: part of a 
fragment derived from a junction sequence 



PCR 



35 



<220> 

<221> misc_feature 
<222> (1) . . (31) 

<223> /note="Wherein N stands for any nucleotide sequence' 
<400> 10 

caggatatat tcaattgtaa atctcncgag g 



31 



40 



45 



50 



55 



<210> 11 
<211> 37 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: part of a PCR 
fragment derived from a junction sequence 

<220> 

<221> misc_f eature 
<222> (1) . . (37) 

<223> /note="Wherein N stands for any nucleotide sequence' 
<400> 11 

attgtattat atattcaatt gtaaatctcn cgaggta 



<210> 12 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 



37 
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<223> Description of Artificial Sequence: part of a PCR 
fragment derived" from a junction sequence 

<220> 

<221> misc_feature 
<222> (1) . . (33) 

<223> /note="Wherein N stands for any nucleotide sequence" 
<400> 12 

tgtgggtgtg attcaattgt aaatctcncg agg 



<210> 13 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: part of a PCR 
fragment derived from a junction sequence 

<220> 

<221> misc_feature 
<222> (1) . . (35) 

<223> /note="Wherein N stands for any nucleotide sequence" 
<400> 13 

gggggcatca gtattcaatt gtaaatctcn cgagg 

<210> 14 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: part of a PCR 
fragment derived from a junction sequence 

<220> 

<221> misc_f eature 
<222> (1) . . (39) 

<400> 14 

gaggtagatg tgagagagtg tgtgtgggtg tgaagtcga 

<210> 15 
<211> 35 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: part of a PCR 
fragment derived from a junction sequence 

<220> 

<221> misc_feature 
<222> (1) . . (35) 

<223> /note="Wherein N stands for any nucleotide sequence" 
<400> 15 

tctggtagat atattcaatt gtaaatctcn cgagg 
<210> 16 
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<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: part of a PCR 
fragment derived from a junction sequence 

<220> 

<221> misc_feature 
<222> (1) . . (35) 

<223> /note="Wherein N stands for any nucleotide sequence" 
<400> 16 

cacatatttc tcattcaatt gtaaatctcn cgagg 

<210> 17 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: part of a PCR 
fragment derived from a junction sequence 

<220> 

<221> misc_feature 
<222> (1) . . (35) 

<223> /note="Wherein N stands for any nucleotide sequence" 
<400> 17 

cgactacttt atatccaatt gtaaatctcn cgagg 

<210> 18 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: part of a PCR 
fragment derived from a junction sequence 

<220> 

<221> misc_feature 
<222> (1) . . (35) 

<223> note="Wherein N stands for any nucleotide sequence" 
<400> 18 

gaagaaccca ttattcaatt gtaaatctcn cgagg 

<210> 19 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: part of a PCR 
fragment derived from a junction sequence 

<220> 

<221> misc_f eature 
<222> (1) . . (35) 

<223> note="Wherein N stands for any nucleotide sequence" 
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<400> 19 

tgggtgtggg ttattcaatt gtaaatctcn cgagg 



<210> 20 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: part of a PCR 
fragment derived from a junction sequence 

<220> 

<221> misc_feature 
<222> (1) . . (35) 

<223> note»"Wherein N stands for any nucleotide sequence" 
<400> 20 

tgggtgtggt gtgttcaatt gtaaatctcn cgagg 



<210> 21 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: part of a PCR 
fragment derived from a junction sequence 

<220> 

<221> misc_feature 
<222> (1) . . (35) 

<223> /note="Wherein N stands for any nucleotide sequence" 
<400> 21 

tgtgtgggtg tgggtcaatt gtaaatctcn cgagg 



<210> 22 
<211> 35 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: part of a PCR 
fragment derived from a junction sequence 

<220> 

<221> misc_feature 
<222> (1) . . (35) 

<223> /note="Wherein N stands for any nucleotide sequence" 
<400> 22 

cgtcaaggat atattcaatt gtaaatctcn cgagg 

<210> 23 
<211> 602 
<212> PRT 

<213> Saccharomyces cerevisiae 
<220> 

<221> SITE 
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<222> (1) . . (602) 
<223> /note^'KU 70" 

<400> 23 

Met Arg Ser Val Thr Asn Ala Phe Gly Asn Ser Gly Glu Leu Asn Asp 
1 5 10 15 

Gin Val Asp Glu Thr Gly Tyr Arg Lys Phe Asp lie His Glu Gly He 
20 25 30 

Leu Phe Cys lie Glu Leu Ser Glu Thr Met Phe Lys Glu Ser Ser Asp 
35 40 45 

Leu Glu Tyr Lys Ser Pro Leu Leu Glu He Leu Glu Ser Leu Asp Glu 
50 55 60 

Leu Met Ser Gin Leu Val He Thr Arg Pro Gly Thr Ala He Gly Cys 
65 70 75 80 

Tyr Phe Tyr Tyr Cys Asn Arg Glu Asp Ala Lys Glu Gly He Tyr Glu 
85 90 95 

Leu Phe Pro Leu Arg Asp He Asn Ala Thr Phe Met Lys Lys Leu Asn 
100 105 110 

Asp Leu Leu Glu Asp Leu Ser Ser Gly Arg He Ser Leu Tyr Asp Tyr 
115 120 125 

Phe Met Phe Gin Gin Thr Gly Ser Glu Lys Gin Val Arg Leu Ser Val 
130 135 140 

Leu Phe Thr Phe Met Leu Asp Thr Phe Leu Glu Glu lie Pro Gly Gin 
145 150 155 160 

Lys Gin Leu Ser Asn Lys Arg Val Phe Leu Phe Thr Asp He Asp Lys 
165 170 175 

Pro Gin Glu Ala Gin Asp He Asp Glu Arg Ala Arg Leu Arg Arg Leu 
180 185 190 

Thr He Asp Leu Phe Asp Asn Lys Val Asn Phe Ala Thr Phe Phe He 
195 200 205 

Gly Tyr Ala Asp Lys Pro Phe Asp Asn Glu Phe Tyr Ser Asp He Leu 
210 215 220 

Gin Leu Gly Ser His Thr Asn Glu Asn Thr Gly Leu Asp Ser Glu Phe 
225 230 235 240 

Asp Gly Pro Ser Thr Lys Pro He Asp Ala Lys Tyr He Lys Ser Arg 
245 250 255 

He Leu Arg Lys Lys Glu Val Lys Arg He Met Phe Gin Cys Pro Leu 
260 265 270 

He Leu Asp Glu Lys Thr Asn Phe lie Val Gly Val Lys Gly Tyr Thr 
275 280 "* 285 

Met Tyr Thr His Glu Lys Ala Gly Val Arg Tyr Lys Leu Val Tyr Glu 
290 295 300 

His Glu Asp He Arg Gin Glu Ala Tyr Ser Lys Arg Lys Phe Leu Asn 
305 310 315 "* 320 

Pro He Thr Gly Glu Asp Val Thr Gly Lys Thr Val Lys Val Tyr Pro 
325 330 335 
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Tyr Gly Asp Leu Asp He Asn Leu Ser Asp Ser Gin Asp Gin He Val 
340 345 350 

Met Glu Ala Tyr Thr Gin Lys Asp Ala Phe Leu Lys He He Gly Phe 
355 360 365 

Arg Ser Ser Ser Lys Ser He His Tyr Phe Asn Asn He Asp Lys Ser 
370 375 380 

Ser Phe He Val Pro Asp Glu Ala Lys Tyr Glu Gly Ser He Arg Thr 
385 390 395 400 

Leu Ala Ser Leu Leu Lys He Leu Arg Lys Lys Asp Lys lie Ala lie 
405 " 410 " * " 415 

Leu Trp Gly Lys Leu Lys Ser Asn Ser His Pro Ser Leu Tyr Thr Leu 
420 425 430 

Ser Pro Ser Ser Val Lys Asp Tyr Asn Glu Gly Phe Tyr Leu Tyr Arg 
435 440 445 

Val Pro Phe Leu Asp Glu He Arg Lys Phe Pro Ser Leu Leu Ser Tyr 
450 455 460 

Asp Asp Gly Ser Glu His Lys Leu Asp Tyr Asp Asn Met Lys Lys Val 
465 470 475 480 

Thr Gin Ser lie Met Gly Tyr Phe Asn Leu Arg Asp Gly Tyr Asn Pro 
485 490 495 

Ser Asp Phe Lys Asn Pro Leu Leu Gin Lys His Tyr Lys Val Leu His 
500 505 " 510 

Asp Tyr Leu Leu Gin lie Glu Thr Thr Phe Asp Glu Asn Glu Thr Pro 
515 520 525 

Asn Thr Lys Lys Asp Arg Met Met Arg Glu Asp Asp Ser Leu Arg Lys 
530 535 540 

Leu Tyr Tyr He Arg Asn Lys lie Leu Glu Ser Glu Lys Ser Glu Asp 
545 550 555 560 

Pro lie lie Gin Arg Leu Asn Lys Tyr Val Lys lie Trp Asn Met Phe 
565 570 575 

Tyr Lys Lys Phe Asn Asp Asp Asn He Ser lie Lys Glu Glu Lys Lys 
580 585 590 

Pro Phe Asp Lys Lys Pro Lys Phe Asn lie 
595 600 



<210> 24 
<211> 609 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 

<222> (1) . . (609) 

<223> /note="KU 70 homologue" 

<400> 24 

Met Ser Gly Trp Glu Ser Tyr Tyr Lys Thr Glu Gly Asp Glu Glu Ala 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



1 5 10 15 

Glu Glu Glu Gin Glu Glu Asn Leu Glu Ala Ser Gly Asp Tyr Lys Tyr 
20 25 30 

Ser Gly Arg Asp Ser Leu He Phe Leu Val Asp Ala Ser Lys Ala Met 
35 40 45 

Phe Glu Ser Gin Ser Glu Asp Glu Leu Thr Pro Phe Asp Met Ser He 
50 55 60 

Gin Cys He Gin Ser Val Tyr He Ser Lys He He Ser Ser Asp Arg 
65 70 75 80 

Asp Leu Leu Ala Val Val Phe Tyr Gly Thr Glu Lys Asp Lys Asn Ser 
85 90 95 

Val Asn Phe Lys Asn He Tyr Val Leu Gin Glu Leu Asp Asn Pro Gly 
100 105 110 

Ala Lys Arg lie Leu Glu Leu Asp Gin Phe Lys Gly Gin Gin Gly Gin 

115 120 125 

Lys Arg Phe Gin Asp Met Met Gly His Gly Ser Asp Tyr Ser Leu Ser 
130 135 140 

Glu Val Leu Trp Val Cys Ala Asn Leu Phe Ser Asp Val Gin Phe Lys 
145 150 155 * 160 

Met Ser His Lys Arg He Met Leu Phe Thr Asn Glu Asp Asn Pro His 
165 170 ~ 175 

Gly Asn Asp Ser Ala Lys Ala Ser Arg Ala Arg Thr Lys Ala Gly Asp 
180 185 190 

Leu Arg Asp Thr Gly He Phe Leu Asp Leu Met His Leu Lys Lys Pro 
195 200 205 

Gly Gly Phe Asp He Ser Leu Phe Tyr Arg Asp He He Ser He Ala 
210 215 ' 220 

Glu Asp Glu Asp Leu Arg Val His Phe Glu Glu Ser Ser Lys Leu Glu 

225 230 . 235 "* 240 

Asp Leu Leu Arg Lys Val Arg Ala Lys Glu Thr Arg Lys Arg Ala Leu 
245 250 255 

Ser Arg Leu Lys Leu Lys Leu Asn Lys Asp He Val He Ser Val Gly 
260 265 270 

He Tyr Asn Leu Val Gin Lys Ala Leu Lys Pro Pro Pro lie Lys Leu 
275 280 " 285 

Tyr Arg Glu Thr Asn Glu Pro Val Lys Thr Lys Thr Arg Thr Phe Asn 
290 295 300 

Thr Ser Thr Gly Gly Leu Leu Leu Pro Ser Asp Thr Lys Arg Ser Gin 
305 310 315 320 

He Tyr Gly Ser Arg Gin He He Leu Glu Lys Glu Glu Thr Glu Glu 
325 330 335 

Leu Lys Arg Phe Asp Asp Pro Gly Leu Met Leu Met Gly Phe Lys Pro 
340 345 350 

Leu Val Leu Leu Lys Lys His His Tyr Leu Arg Pro Ser Leu Phe Val 
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355 360 365 

Tyr Pro Glu Glu Ser Leu Val lie Gly Ser Ser Thr Leu Phe Ser Ala 
370 375 380 

Leu Leu lie Lys Cys Leu Glu Lys Glu Val Ala Ala Leu Cys Arg Tyr 
385 390 395 400 

Thr Pro Arg Arg Asn lie Pro Pro Tyr Phe Val Ala Leu Val Pro Gin 
405 410 415 

Glu Glu Glu Leu Asp Asp Gin Lys lie Gin Val Thr Pro Pro Gly Phe 
420 425 430 

Gin Leu Val Phe Leu Pro Phe Ala Asp Asp Lys Arg Lys Met Pro Phe 
435 440 445 

Thr Glu Lys He Met Ala Thr Pro Glu Gin Val Gly Lys Met Lys Ala 
450 455 460 

He Val Glu Lys Leu Arg Phe Thr Tyr Arg Ser Asp Ser Phe Glu Asn 
465 470 475 480 

Pro Val Leu Gin Gin His Phe Arg Asn Leu Glu Ala Leu Ala Leu Asp 
485 490 495 

Leu Met Glu Pro Glu Gin Ala Val Asp Leu Thr Leu Pro Lys Val Glu 
500 505 510 

Ala Met Asn Lys Arg Leu Gly Ser Leu Val Asp Glu Phe Lys Glu Leu 
515 520 525 

Val Tyr Pro Pro Asp Tyr Asn Pro Glu Gly Lys Val Thr Lys Arg Lys 
530 535 540 

His Asp Asn Glu Gly Ser Gly Ser Lys Arg Pro Lys Val Glu Tyr Ser 
545 550 555 560 

Glu Glu Glu Leu Lys Thr His He Ser Lys Gly Thr Leu Gly Lys Phe 
565 570 575 

Thr Val Pro Met Leu Lys Glu Ala Cys Arg Ala Tyr Gly Leu Lys Ser 
580 585 " 590 

Gly Leu Lys Lys Gin Glu Leu Leu Glu Ala Leu Thr Lys His Phe Gin 
595 600 605 

Asp 



<210> 25 
<211> 477 
<212> PRT 

<213> Arabidopsis thaiiana 
<220> 

<221> SITE 

<222> (1) . . (477) 

<223> /note="KU 70 homologue" 

<400> 25 

Glu Asn Ser Leu Tyr Ser Ala Leu Trp Val Ala Gin Ala Leu Leu Arg 
15 10 15 
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Lys Gly Ser Leu Lys Thr Ala Asp Lys Arg Met Phe Leu Phe Thr Asn 
20 25 30 

Glu Asp Asp Pro Phe Gly Ser Met Arg He Ser Val Lys Glu Asp Met 
35 40 45 

Thr Arg Thr Thr Leu Gin Arg Ala Lys Asp Ala Gin Asp Leu Gly lie 
50 55 60 

Ser He Glu Leu Leu Pro Leu Ser Gin Pro Asp Lys Gin Phe Asn He 
65 70 75 80 

Thr Leu Phe Tyr Lys Asp Leu He Gly Leu Asn Ser Asp Glu Leu Thr 
85 90 95 

Glu Phe Met Pro Ser Val Gly Gin Lys Leu Glu Asp Met Lys Asp Gin 
100 105 * 110 

Leu Lys Lys Arg Val Leu Ala Lys Arg He Ala Lys Arg He Thr Phe 
115 120 125 

Val He Cys Asp Gly Leu Ser He Glu Leu Asn Gly Tyr Ala Leu Leu 
130 135 140 

Arg Pro Ala He Pro Gly Ser lie Thr Trp Leu Asp Ser Thr Thr Asn 
145 150 155 160 

Leu Pro Val Lys Val Glu Arg Ser Tyr He Cys Thr Asp Thr Gly Ala 
165 170 175 

He Met Gin Asp Pro He Gin Arg He Gin Pro Tyr Lys Asn Gin Asn 
180 185 ^ 190 

He Met Phe Thr Val Glu Glu Leu Ser Gin Val Lys Arg He Ser Thr 
195 200 205 

Gly His Leu Arg Leu Leu Gly Phe Lys Pro Leu Ser Cys Leu Lys Asp 
210 215 220 

Tyr His Asn Leu Lvs Pro Ser Thr Phe Leu Tyr Pro Ser Asp Lys Glu 
225 230 235 240 

Val He Gly Ser Thr Arg Ala Phe He Ala Leu His Arg Ser Met He 
245 250 255 

Gin Leu Glu Arg Phe Ala Val Ala Phe Tyr Gly Gly Thr Thr Pro Pro 
260 265 270 

Arg Leu Val Ala Leu Val Ala Gin Asp Glu He Glu Ser Asp Gly Gly 
275 280 285 

Gin Val Glu Pro Pro Gly He Asn Met He Tyr Leu Pro Tyr Ala Asn 
290 295 300 

Asp lie Arg Asp lie Asp Glu Leu His Ser Lys Pro Gly Val Ala Xaa 
305 310 315 * 320 

Pro Arg Ala Ser Asp Asp Gin Leu Lys Lys Ala Ser Ala Leu Met Arg 
325 330 335 

Arg Leu Glu Leu Lys Asp Phe Ser Val Cys Gin Phe Ala Asn Pro Ala 
340 345 350 

Leu Gin Arg His Tyr- Ala He Leu Gin Ala He Ala Leu Asp Glu Asn 
355 360 365 
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Glu Leu Arg Glu Thr Arg Asp Glu Thr Leu Pro Asp Glu Glu Gly Met 
370 375 380 

Asn Arg Pro Ala Val Val Lys Ala lie Glu Gin Phe Lys Gin Ser lie 
385 390 395 400 

Tyr Gly Asp Asp Pro Asp Glu Glu Ser Asp Ser Gly Ala Lys Glu Lys 
405 410 415 

Ser Lys Lys Arg Lys Ala Gly Asp Ala Asp Asp Gly Lys Tyr Asp Tyr 
420 425 430 

lie Glu Leu Ala Lys Thr Gly Lys Leu Lys Asp Leu Thr Val Val Glu 
435 440 445 

Leu Lys Thr Tyr Leu Thr Ala Asn Asn Leu Leu Val Ser Gly Lys Lys 
450 455 460 

Glu Val Leu lie Asn Arg lie Leu Thr His lie Gly Lys 
465 470 475 



<210> 26 
<211> 944 
<212> PRT 

<213> Saccharomyces cerevisiae 
<220> 

<221> SITE 
<222> (1) . . (944) 
<223> /nor.e*="LIG 4" 

<400> 26 

Met lie Ser Ala Leu Asp Ser He Pro Glu Pro Gin Asn Phe Ala Pro 
15 10 15 

Ser Pro Asp Phe Lys Trp Leu Cys Glu Glu Leu Phe Val Lys lie His 
20 25 30 

Glu Val Gin lie Asn Gly Thr Ala Gly Thr Gly Lys Ser Arg Ser Phe 
35 40 45 

Lys Tyr Tyr Glu He He Ser Asn Phe Val Glu Met Trp Arg Lys Thr 
50 55 60 

Val Gly Asn Asn He Tyr Pro Ala Leu Val Leu Ala Leu Pro Tyr Arg 
65 70 75 80 

Asp Arg Arg He Tyr Asn He Lys Asp Tyr Val Leu lie Arg Thr He 
85 90 95 

Cys Ser Tyr Leu Lys Leu Pro Lys Asn Ser Ala Thr Glu Gin Arg Leu 
100 105 110 

Lys Asp Trp Lys Gin Arg Val Gly Lys Gly Gly Asn Leu Ser Ser Leu 
115 120 125 

Leu Val Glu Glu He Ala Lys Arg Arg Ala Glu Pro Ser Ser Lys Ala 
130 135 140 

He Thr He Asp Asn Val Asn His Tyr Leu Asp Ser Leu Ser Gly Asp 
145 150 155 160 

Arg Phe Ala Ser Gly Arg Gly Phe Lys Ser Leu Val Lys Ser Lys Pro 
165 170 175 
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Phe Leu His Cys Val Glu Asn Met Ser Phe Val Glu Leu Lys Tyr Phe 
180 185 190 

Phe Asp lie Val Leu Lys Asn Arg Val He Gly Gly Gin Glu His Lys 
195 200 205 

Leu Leu Asn Cys Trp His Pro Asp Ala Gin Asp Tyr Leu Ser Val He 
210 215 220 

Ser Asp Leu Lys Val Val Thr Ser Lys Leu Tyr Asp Pro Lys Val Arg 
225 230 235 240 

Leu Lys Asp Asp Asp Leu Ser lie Lys Val Gly Phe Ala Phe Ala Pro 
245 250 255 

Gin Leu Ala Lys Lys Val Asn Leu Ser Tyr Glu Lys He Cys Arg Thr 
260 265 "* 270 

Leu His Asp Asp Phe Leu Val Glu Glu Lys Met Asp Gly Glu Arg He 
275 280 285 

Gin Val His Tyr Met Asn Tyr Gly Glu Ser He Lys Phe Phe Ser Arg 
290 295 300 

Arg Gly He Asp Tyr Thr Tyr Leu Tyr Gly Ala Ser Leu Ser Ser Gly 
305 310 315 320 

Thr He Ser Gin His Leu Arg Phe Thr Asp Ser Val Lys Glu Cys Val 
325 330 335 

Leu Asp Gly Glu Met Val Thr Phe Asp Ala Lys Arg Arg Val He Leu 
340 345 350 

Pro Phe Gly Leu Val Lys Gly Ser Ala Lys Glu Ala Leu Ser Phe Asn 
355 360 365 

Ser He Asn Asn Val Asp Phe His Pro Leu Tyr Met Val Phe Asp Leu 
370 375 380 

Leu Tyr Leu Asn Gly Thr Ser Leu Thr Pro Leu Pro Leu His Gin Arg 
385 390 395 400 

Lys Gin Tyr Leu Asn Ser He Leu Ser Pro Leu Lys Asn He Val Glu 
405 410 ' 415 

He Val Arg Ser Ser Arg Cys Tyr Gly Val Glu Ser He Lys Lys Ser 
420 425 430 

Leu Glu Val Ala He Ser Leu Gly Ser Glu Gly Val Val Leu Lys Tyr 
435 440 445 

Tyr Asn Ser Ser Tyr Asn Val Ala Ser Arg Asn Asn Asn Trp He Lys 
450 455 460 

Val Lys Pro Glu Tyr Leu Glu Glu Phe Gly Glu Asn Leu Asp Leu He 
465 470 475 480 

Val He Gly Arg Asp Ser Gly Lys Lys Asp Ser Phe Met Leu Glv Leu 
485 490 495 

Leu Val Leu Asp Glu Glu Glu Tyr Lys Lys His Gin Gly Asp Ser Ser 
500 505 510 

Glu He Val Asp His Ser Ser Gin Glu Lys His He Gin Asn Ser Arg 
515 520 525 
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Phe Leu His Cys Val Glu Asn Met Ser Phe Val Glu Leu Lys Tyr Phe 
180 185 190 

Phe Asp lie Val Leu Lys Asn Arg Val lie Gly Gly Gin Glu His Lys 
195 200 205 

Leu Leu Asn Cys Trp His Pro Asp Ala Gin Asp Tyr Leu Ser Val He 
210 215 220 

Ser Asp Leu Lys Val Val Thr Ser Lys Leu Tyr Asp Pro Lys Val Arg 
225 230 ~ 235 240 

Leu Lys Asp Asp Asp Leu Ser He Lys Val Gly Phe Ala Phe Ala Pro 
245 250 255 

Gin Leu Ala Lys Lys Val Asn Leu Ser Tyr Glu Lys He Cys Arg Thr 
260 265 ^ " 270 

Leu His Asp Asp Phe Leu Val Glu Glu Lys Met Asp Gly Glu Arg He 
275 280 285 

Gin Val His Tyr Met Asn Tyr Gly Glu Ser He Lys Phe Phe Ser Arg 
290 295 300 

Arg Gly He Asp Tyr Thr Tyr Leu Tyr Gly Ala Ser Leu Ser Ser Gly 
305 310 315 320 

Thr lie Ser Gin His Leu Arg Phe Thr Asp Ser Val Lys Glu Cys Val 
325 330 335 

Leu Asp Gly Glu Met Val Thr Phe Asp Ala Lys Arg Arg Val lie Leu 
340 345 350 

Pro Phe Gly Leu Val Lys Gly Ser Ala Lys Glu Ala Leu Ser Phe Asn 
355 360 3 65 

Ser He Asn Asn Val Asp Phe His Pro Leu Tyr Met Val Phe Asp Leu 
370 375 380 

Leu Tyr Leu Asn Gly Thr Ser Leu Thr Pro Leu Pro Leu His Gin Arg 
385 390 395 400 

Lys Gin Tyr Leu Asn Ser He Leu Ser Pro Leu Lys Asn lie Val Glu 
405 410 415 

lie Val Arg Ser Ser Arg Cys Tvr Gly Val Glu Ser lie Lys Lys Ser 
420 425 430 

Leu Glu Val Ala He Ser Leu Gly Ser Glu Gly Val Val Leu Lys Tyr 
435 440 445 

Tyr Asn Ser Ser Tyr Asn Val Ala Ser Arg Asn Asn Asn Trp lie Lys 
450 455 460 

Val Lys Pro Glu Tyr Leu Glu Glu Phe Gly Glu Asn Leu Asp Leu He 
465 470 475 480 

Val lie Gly Arg Asp Ser Gly Lys Lys Asp Ser Phe Met Leu Gly Leu 
485 490 495 

Leu Val Leu Asp Glu Glu Glu Tyr Lys Lys His Gin Gly Asp Ser Ser 
500 505 510 

Glu He Val Asp His Ser Ser Gin Glu Lys His lie Gin Asn Ser Arg 
515 520 525 
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Arg Arg Val Lys Lys lie Leu Ser Phe Cys Ser He Ala Asn Gly lie 
"0 535 540 

Ser Gin Glu Glu Phe Lys Glu He Asp Arg Lys Thr Arg Gly His Trp 
545 ' 555 560 

Lys Arg Thr Ser Glu Val Ala Pro Pro Ala Ser He Leu Glu Phe Gly 
565 570 575 

Ser Lys He Pro Ala Glu Trp He Asp Pro Ser Glu Ser He Val Leu 
580 585 590 

Glu He Lys Ser Arg Ser Leu Asp Asn Thr Glu Thr Asn Met Gin Lys 
595 600 605 

Tyr Ala Thr Asn Cys Thr Leu Tyr Gly Gly Tyr Cys Lys Arg He Arg 
610 615 620 

Tyr Asp Lys Glu Trp Thr Asp Cys Tyr Thr Leu Asn Asp Leu Tyr Glu 
62 * 630 635 640 

Ser Arg Thr Val Lys Ser Asn Pro Ser Tyr Gin Ala Glu Arg Ser Gin 
645 650 655 

Leu Gly Leu He Arg Lys Lys Arg Lys Arg Val Leu He Ser Asp Ser 
660 665 670 

Phe His Gin Asn Arg Lys Gin Leu Pro lie Ser Asn lie Phe Ala Gly 
675 680 685 

Leu Leu Phe Tyr Val Leu Ser Asp Tyr Val Thr Glu Asp Thr Gly He 
690 695 700 

Arg He Thr Arg Ala Glu Leu Glu Lys Thr He Val Glu His Gly Glv 
705 710 " 715 720 

Lys Leu He Tyr Asn Val He Leu Lys Arg His Ser He Gly Asp Val 
725 730 735 

Arg Leu He Ser Cys Lys Thr Thr Thr Glu Cys Lys Ala Leu He Asp 
740 745 ~ 750 

Arg Gly Tyr Asp lie Leu His Pro Asn Trp Val Leu Asp Cys He Ala 
755 760 765 

Tyr Lys Arg Leu lie Leu lie Glu Pro Asn Tyr Cys Phe Asn Val Ser 
770 775 780 

Gin Lys Met Arg Ala Val Ala Glu Lys Arg Val Asp Cys Leu Gly Asp 
785 790 795 800 

Ser Phe Glu Asn Asp lie Ser Glu Thr Lys Leu Ser Ser Leu Tyr Lvs 
805 810 815 ' 

Ser Gin Leu Ser Leu Pro Pro Met Gly Glu Leu Glu lie Asp Ser Glu 
820 825 830 

Val Arg Arg Phe Pro Leu Phe Leu Phe Ser Asn Arg He Ala Tyr Val 
835 840 845 

Pro Arg Arg Lys He Ser Thr Glu Asp Asp lie lie Glu Mec Lys lie 
850 855 860 

Lys Leu Phe Gly Gly Lys lie Thr Asp Gin Gin Ser Leu Cys Asn Leu 
863 870 s; 5 880 
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He He He Pro Tyr Thr Asp Pro 
885 

Glu Val His Glu Lys He Lys Glu 
900 

Pro Lys lie Ala Arg Val Val Ala 
915 920 

Asn Glu Asn Cys Gin Val Pro Glu 
930 935 



He Leu Arg Lys Asp Cys Met Asn 
890 ~ - 895 

Gin He Lys Ala Ser Asp Thr He 
905 910 

Pro Glu Trp Val Asp His Ser lie 
925 

Glu Asp Phe Pro Val Val Asn Tyr 
940 



<210> 27 

<211> 844 

<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 

<222> (1) . . (844) 

<223> /note="LIG 4 homologue" 

<400> 27 

Met Arg Leu He Leu Pro Gin Leu Glu Arg Glu Arg Met Ala Tyr Gly 
15 10 15 

He Lys Glu Thr Met Leu Ala Lys Leu Tyr He Glu Leu Leu Asn Leu 
20 25 30 

Pro Arg Asp Gly Lys Asp Ala Leu Lys Leu Leu Asn Tyr Arg Thr Pro 
35 40 45 

Thr Gly Thr His Gly Asp Ala Gly Asp Phe Ala Met He Ala Tyr Phe 
50 55 60 

Val Leu Lys Pro Arg Cys Leu Gin Lys Gly Ser Leu Thr He Gin Gin 
65 70 75 80 

Val Asn Asp Leu Leu Asp Ser lie Ala Ser Asn Asn Ser Ala Lys Arg 
85 90 95 

Lys Asp Leu He Lys Lys Ser Leu Leu Gin Leu lie Thr Gin Ser Ser 
100 105 110 

Ala Leu Glu Gin Lys Trp Leu lie Arg Met lie He Lys Asp Leu Lys 
115 120 125 

Leu Gly Val Ser Gin Gin Thr lie Phe Ser Val Phe His Asn Asp Ala 
130 135 140 

Ala Glu Leu His Asn Val Thr Thr Asp Leu Glu Lys Val Cys Arg Gin 
145 150 155 160 

Leu His Asp Pro Ser Val Gly Leu Ser Asp He Ser lie Thr Leu Phe 
165 17C 175 

Ser Ala Ser Lys Pro Met Leu Ala Ala He Ala Asp lie Glu His lie 
180 185 190 

Glu Lys Asp Met Lys His Gin Ser Phe Tyr lie Glu Thr Lvs Leu Asp 
195 200 205 

Gly Glu Arg Met Gin Met His Lys Asp Gly Asp Val Tyr Lys Tyr Phe 
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210 



215 



220 



10 



15 



20 



25 



Ser Arg Asn Gly Tyr Asn Tyr Thr Asp Gin Phe Gly Ala Ser Pro Thr 
225 230 235 240 

Glu Gly Ser Leu Thr Pro Phe He His Asn Ala Phe Lys Ala Asp He 
245 250 255 

Gin He Cys He Leu Asp Gly Glu Met Met Ala Tyr Asn Pro Asn Thr 
260 265 270 

Gin Thr Phe Met Gin Lys Gly Thr Lys Phe Asp He Lys Arg Met Val 
275 280 285 

Glu Asp Ser Asp Leu Gin Thr Cys Tyr Cys Val Phe Asp Val Leu Met 
290 295 300 

Val Asn Asn Lys Lys Leu Gly His Glu Thr Leu Arg Lys Arg Tyr Glu 
305 310 315 320 

He Leu Ser Ser He Phe Thr Pro He Pro Gly Arg lie Glu He Val 
325 330 ^ 335 

Gin Lys Thr Gin Ala His Thr Lys Asn Glu Val He Asp Ala Leu Asn 
340 345 350 

Glu Ala lie Asp Lys Arg Glu Glu Gly He Met Val Lys Gin Pro Leu 
355 360 365 

Ser He Tyr Lys Pro Asp Lys Arg Gly Glu Gly Trp Leu Lys He Lys 
370 375 380 



30 



Pro Glu Tyr Val Ser Gly Leu Met Asp Glu Leu Asp He Leu lie Val 
385 390 395 400 

Gly Gly Tyr Trp Gly Lys Gly Ser Arg Gly Gly Met Met Ser His Phe 
405 410 415 



35 



Leu Cys Ala Val Ala Glu Lys Pro Pro Pro Gly Glu Lys Pro Ser Val 

420 425 430 

Phe His Thr Leu Ser Arg Val Gly Ser Gly Cys Thr Met Lys Glu Leu 
435 440 445 



40 



Tyr Asp Leu Gly Leu Lys Leu Ala Lys Tyr Trp Lys Pro Phe His Arg 

' 450 455 460 

Lys Ala Pro Pro Ser Ser lie Leu Cys Gly Thr Glu Lys Pro Glu Val 
465 470 475 " 480 



45 



Tyr He Glu Pro Cys Asn Ser Val lie Val Gin He Lys Ala Ala Glu 
485 490 495 

He Val Pro Ser Asp Met Tyr Lys Thr Gly Cys Thr Leu Arg Phe Pro 
500 505 * 510 



50 



Arg He Glu Lys He Arg Asp Asp Lys Glu Trp His Glu Cys Met Thr 

515 520 525 

Leu Asp Asp Leu Glu Gin Leu Arg Gly Lys Ala Ser Gly Lvs Leu Ala 

530 535 540 



55 



Ser Lys His Leu Tyr He Gly Gly Asp Asp Glu Pro Gin Glu Lys Lys 
545 550 555 560 

Arg Lys Ala Ala Pro Lys Met Lys Lys Val He Gly He lie Glu His 
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565 570 575 

Leu Lys Ala Pro Asn Leu Thr Asn Val Asn Lys lie Ser Asn lie Phe 
580 585 590 

Glu Asp Val Glu Phe Cys Val Met Ser Gly Thr Asp Ser Gin Pro Lys 
595 600 605 

Pro Asp Leu Glu Asn Arg lie Ala Glu Phe Gly Gly Tyr lie Val Gin 
610 615 620 

Asn Pro Gly Pro Asp Thr Tyr Cys Val He Ala Gly Ser Glu Asn He 
625 630 635 640 

Arg Val Lys Asn He He Leu Ser Asn Lys His Asp Val Val Lys Pro 
645 650 655 

Ala Trp Leu Leu Glu Cys Phe Lys Thr Lys Ser Phe Val Pro Trp Gin 
660 665 670 

Pro Arg Phe Met He His Met Cys Pro Ser Thr Lys Glu His Phe Ala 
675 680 685 

Arg Glu Tyr Asp Cys Tyr Gly Asp Ser Tyr Phe He Asp Thr Asp Leu 
690 695 700 

Asn Gin Leu Lys Glu Val Phe Ser Gly lie Lys Asn Ser Asn Glu Gin 
705 710 715 720 

Thr Pro Glu Glu Met Ala Ser Leu He Ala Asp Leu Glu Tyr Arg Tyr 
725 ' 730 735 

Ser Trp Asp Cys Ser Pro Leu Ser Met Phe Arg Arg His Thr Val Tyr 
740 745 ~ 750 

Leu Asp Ser Tyr Ala Val He Asn Asp Leu Ser Thr Lys Asn Glu Gly 
755 760 765 

Thr Arg Leu Ala lie Lys Ala Leu Glu Leu Arg Phe His Gly Ala Lys 
770 775 780 

Val Val Ser Cys Leu Ala Glu Gly Val Ser His Val He He Gly Glu 
785 790 795 800 

Asp His Ser Arg Val Ala Asp Phe Lys Ala Phe Arg Arg Thr Phe Lys 
805 810 815 

Arg Lys Phe Lys lie Leu Lys Glu Ser Trp Val Thr Asp Ser He Asp 
820 825 830 

Lys Cys Glu Leu Gin Glu Glu Asn Gin Tyr Leu lie 
835 840 



<210> 28 
<211> 1219 
<212> PRT 

<213> Arabidopsis thaliana 
<220> 

<221> SITE 

<222> (1) . . (1219) 

<223> /note="LIG 4 homologue" 

<400> 28 
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Met Thr Glu Glu 
1 



lie Gin Lys Ser 
20 



Phe Leu Asp Thr 
35 



Leu lie He Pro 
50 



Glu Ser Val Leu 
65 



Asp Ala Pro Asp 



Ala Lys Ala Gly 
100 



Val Leu Gin Arg 
115 



Glu Leu Asn Asp 
130 



Glu Lys Thr Leu 
145 



Glu Met Lys Trp 



Met Ser Glu Lys 
180 



Leu Phe Asn Val 
195 



Asp Arg His Gin 
210 



Val Arg Pro Gin 
225 



Lys Lys Leu His 



Asp Arg lie Gin 
260 



Arg Asn Phe Leu 

275 



He Val Gin Asn 
290 



Leu Val Tro Asp 
305 



Gin Glu He Ala 



Leu Cys Tyr Val 
340 



He Lys Phe Ser 
5 



Lys Thr Ser Ser 



Tyr Cys Lys Pro 
40 



Ser Leu Asp Arg 
55 



Ala Thr Cys Leu 
70 



Ala Val Arg Leu 
85 



Ala Asn Ala Gly 



Arg Gin Gly Met 
120 



Leu Leu Asp Arg 
135 



Val Leu Ser Thr 
150 



Val He Arg He 
165 



Ser lie Phe Gin 



Thr Cys Asp Leu 
200 



Arg His Lys Arg 
215 



Leu Ala Met Arg 
230 



Gly Lys Asp Val 
24 5 



He His Lys Asn 



Asp His Ser Glu 
280 



He Leu Val Asp 
2 95 



Thr Ser Leu Asn 
310 



Lys Ala Ala Arg 
325 



Ala Phe Asp Val 



Val Leu Val Ser 
10 



Gin Lys Arg Ser 
25 



Ser Asp. Tyr Phe 



Glu Arg Gly Ser 
60 

He Asp Ala Leu 
75 



Leu Asn Trp Arg 
90 



Asn Phe Ser Leu 
105 



Ala Ser Gly Gly 



Leu Ala Ser Ser 
140 



Leu He Gin Lys 
155 

He Leu Lys Asp 
170 



Glu Phe His Pro 
185 



Lys Leu Val Cys 



Gin Asp He Glu 
220 

He Gly Asp Val 
235 

Val Ala Glu Cys 
250 



Gly Thr Asp He 
265 



Tyr Ala His Ala 



Lys Cys He Leu 
300 



Arg Phe Ala Glu 
315 



Glu Gly Leu Asp 
330 



Leu Tyr Val Gly 
34 5 



Leu Phe Asn Trp 
15 



Lys Phe Arg Lys 
30 



Val Ala Val Arg 
45 



Tyr Gly Leu Lys 



Gly He Ser Arg 
80 



Lys Gly Gly Thr 
95 



He Ala Ala Glu 
110 



Leu Thr He Lys 
125 



Glu Asn Arg Ala 



Thr Asn Ala Gin 
160 

Leu Lys Leu Gly 
175 



Asp Ala Glu Asp 
190 



Glu Lys Leu Arg 
205 



Val Gly Lys Ala 



Asn Ala Ala Trp 
240 



Lys Phe Asp Gly 
255 

His Tyr Phe Ser 
270 



Met Ser Asp Leu 

285 

Asp Gly Glu Met 



Phe Gly Ser Asn 
320 



Ser His Lys Gin 
335 



Asp Thr Ser Val 
350 
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He His Gin Ser Leu Lys Glu Arg His Glu Leu Leu Lys Lys Val Val 
355 360 365 

Lys Pro Leu Lys Gly Arg Leu Glu Val Leu Val Pro Glu Gly Gly Leu 
370 375 380 

Asn Val His Arg Pro Ser Gly Glu Pro Ser Trp Ser He Val Val His 
385 390 395 400 

Ala Ala Ala Asp Val Glu Arg Phe Phe Lys Glu Thr Val Glu Asn Arg 
405 410 415 

Asp Glu Gly lie Val Leu Lys Asp Leu Glu Ser Lys Trp Glu Pro Gly 
420 425 430 

Asp Arg Ser Gly Lys Trp Met Lys Leu Lys Pro Glu Tyr He Arg Ala 
435 440 445 

Gly Ala Asp Leu Asp Val Leu He He Gly Gly Tyr Tyr Gly Ser Gly 
4 50 4 55 4 60 

Arg Arg Gly Gly Glu Val Ala Gin Phe Leu Val Ala Leu Ala Asp Arg 
465 470 475 480 

Ala Glu Ala Asn Val Tyr Pro Arg Arg Phe Met Ser Phe Cys Arg Val 
485 490 495 

Gly Thr Gly Leu Ser Asp Asp Glu Leu Asn Thr Val Val Ser Lys Leu 
500 505 510 

Lys Pro Tyr Phe Arg Lys Asn Glu His Pro Lys Lys Ala Pro Pro Ser 
515 520 525 

Phe Tyr Gin Val Thr Asn His Ser Lys Glu Arg Pro Asp Val Trp He 
530 535 540 

Asp Ser Pro Glu Lys Ser He He Leu Ser He Thr Ser Asp He Arg 
545 550 555 560 

Thr He Arg Ser Glu Val Phe Val Ala Pro Tyr Ser Leu Arg Phe Pro 
565 570 575 

Arg He Asp Lys Val Arg Tyr Asp Lys Pro Trp His Glu Cys Leu Asp 
580 585 590 

Val Gin Ala Phe Val Glu Leu Val Asn Ser Ser Asn Gly Thr Thr Gin 
595 600 605 

Lys Gin Lys Glu Ser Glu Ser Thr Gin Asp Asn Pro Lys Val Asn Lys 
610 615 620 

Ser Ser Lys Arg Gly Glu Lys Lys Asn Val Ser Leu Val Pro Ser Gin 
625 630 635 640 

Phe He Gin Thr Asp Val Ser Asp He Lys Gly Lys Thr Ser He Phe 
645 650 655 

Ser Asn Met He Phe Tyr Phe Val Asn Val Pro Arg Ser His Ser Leu 
660 665 670 

Glu Thr Phe His Lys Met Val Val Glu Asn Gly Gly Lys Phe Ser Met 
675 680 685 

Asn Leu Asn Asn Ser Val Thr His Cys He Ala Ala Glu Ser Ser Gly 
690 695 700 
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He Lys Tyr Gin Ala Ala Lys Arg Gin Arg Asp Val lie His Phe Ser 
705 710 715 720 

Trp Val Leu Asp Cys Cys Ser Arg Asn Lys Met Leu Pro Leu Leu Pro 
"725 730 735 

Lys Tyr Phe Leu His Leu Thr Asp Ala Ser Arg Thr Lys Leu Gin Asp 
740 745 750 

Asp lie Asp Glu Phe Ser Asp Ser Tyr Tyr Trp Asp Leu Asp Leu Glu 
755 760 765 

Gly Leu Lys Gin Val Leu Ser Asn Ala Lys Gin Ser Glu Asp Ser Lys 
770 775 780 

Ser He Asp Tyr Tyr Lys Lys Lys Leu Cys Pro Glu Lys Arg Trp Ser 
785 790 795 800 

Cys Leu Leu Ser Cys Cys Val Tyr Phe Tyr Pro Tyr Ser Gin Thr Leu 
805 810 815 

Ser Thr Glu Glu Glu Ala Leu Leu Gly He Met Ala Lys Arg Leu Met 
820 825 830 

Leu Glu Val Leu Met Ala Gly Gly Lys Val Ser Asn Asn Leu Ala His 
835 840 845 

Ala Ser His Leu Val Val Leu Ala Met Ala Glu Glu Pro Leu Asp Phe 
850 855 860 

Thr Leu Val Ser Lys Ser Phe Ser Glu Met Glu Lys Arg Leu Leu Leu 
865 870 875 880 

Lys Lys Arg Leu His Val Val Ser Ser His Trp Leu Glu Glu Ser Leu 
885 890 895 

Gin Arg Glu Glu Lys Leu Cys Glu Asp Val Tyr Thr Leu Arg Pro Lys 
900 905 " 910 

Tyr Met Glu Glu Ser Asp Thr Glu Glu Ser Asp Lys Ser Glu His Asp 
915 920 925 

Thr Thr Glu Val Ala Ser Gin Gly Ser Ala Gin Thr Lys Glu Pro Ala 
930 935 940 

Ser Ser Lys He Ala He Thr Ser Ser Arg Gly Arg Ser Asn Thr Arg 
945 950 955 ~ 960 

Ala Val Lys Arg Gly Arg Ser Ser Thr Asn Ser Leu Gin Arg Val Gin 
965 970 975 

Arg Arg Arg Gly Lys Gin Pro Ser Lys He Ser Gly Asp Glu Thr Glu 
980 985 990 

Glu Ser Asp Ala Ser Glu Glu Lys Val Ser Thr Arg Leu Ser Asp He 
995 1000 1005 

Ala Glu Glu Thr Asp Ser Phe Gly Glu Ala Gin Arg Asn Ser Ser Arg 
1010 1015 1020 

Gly Lys Cys Aid Lys Arg Gly Lys Ser Arg Val Gly Gin Thr Gin Arg 
!025 1030 1035 1040 

Val Gin Arg Ser Arg Arg Gly Lys Lys Ala Ala Lys He Gly Gly Asp 
1045 1050 1055 
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Glu Ser Asp Glu Asn Asp Glu Leu Asp Gly Asn Asn Asn Val Ser Ala 
1060 1065 1070 

Asp Ala Glu Glu Gly Asn Ala Ala Gly Arg Ser Val Glu Asn Glu Glu 
1075 1080 1085 

Thr Arg Glu Pro Asp lie Ala Lys Tyr Thr Glu Ser Gin Gin Arg Asp 
1090 1095 1100 

Asn Thr Val Ala Val Glu Glu Ala Leu Gin Asp Ser Arg Asn Ala Lys 
1105 1110 H15 ' 1120 

Thr Glu Met Asp Met Lys Glu Lys Leu Gin lie His Glu Asp Pro Leu 
1125 1130 1135 

Gin Ala Met Leu Met Lys Met Phe Pro lie Pro Ser Gin Lys Thr Thr 
1140 1145 1150 

Glu Thr Ser Asn Arg Thr Thr Gly Glu Tyr Arg Lys Ala Asn Val Ser 
1155 1160 1165 

Gly Glu Cys Glu Ser Ser Glu Lys Arg Lys Leu Asp Ala Glu Thr Asp 
1170 1175 1180 

Asn Thr Ser Val Asn Ala Gly Ala Glu Ser Asp Val Val Pro Pro Leu 
1185 1190 1195 1200 

Val Lys Lys Lys Lys Val Ser Tyr Arg Asp Val Ala Gly Glu Leu Leu 
1205 1210 1215 

Lys Asp Trp 



<210> 29 
<211> 692 
<212> PRT 

<213> Saccharomyces cerevisiae 
<220> 

<221> SITE 
<222> {1) . . (692) 
<223> /note="MRE 11" 

<400> 29 

Met Asp Tyr Pro Asp Pro Asp Thr He Arg He Leu lie Thr Thr Asp 
15 10 15 

Asn His Val Gly Tyr Asn Glu Asn Asp Pro He Thr Gly Asp Asp Ser 
20 25 30 

Trp Lys Thr Phe His Glu Val Met Met Leu Ala Lys Asn Asn Asn Val 
35 40 45 

Asp Met Val Val Gin Ser Gly Asp Leu Phe His Val Asn Lys Pro Ser 
50 55 60 

Lys Lys Ser Leu Tyr Gin Val Leu Lys Thr Leu Arg Leu Cys Cys Met 
65 70 75 80 

Gly Asp Lys Pro Cys Glu Leu Glu Leu Leu Ser Asp Pro Ser Gin Val 
85 90 95 

Phe His Tyr Asp Glu Phe Thr Asn Val Asn Tyr Glu Asp Pro Asn Phe 
100 105 110 
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Asn lie Ser He Pro Val Phe Giy He Ser Giy Asn His Asp Asp Ala 
115 120 125 

Ser Giy Asp Ser Leu Leu Cys Pro Met Asp He Leu His Ala Thr Giy 
130 135 * 140 

Leu He Asn His Phe Giy Lys Val He Glu Ser Asp Lys He Lys Val 
145 150 155 160 

Val Pro Leu Leu Phe Gin Lys Giy Ser Thr Lys Leu Ala Leu Tyr Giy 
165 170 175 

Leu Ala Ala Val Arg Asp Glu Arg Leu Phe Arg Thr Phe Lys Asp Giy 
180 185 190 

Giy Val Thr Phe Glu Val Pro Thr Met Arg Glu Giy Glu Trp Phe Asn 
195 200 205 

Leu Met Cys Val His Gin Asn His Thr Giy His Thr Asn Thr Ala Phe 
210 215 220 

Leu Pro Glu Gin Phe Leu Pro Asp Phe Leu Asp Met Val He Trp Giy 
225 230 235 240 

His Glu His Glu Cys He Pro Asn Leu Val His Asn Pro He Lys Asn 
245 250 255 

Phe Asp Val Leu Gin Pro Giy Ser Ser Val Ala Thr Ser Leu Cys Glu 
260 265 270 

Ala Glu Ala Gin Pro Lys Tyr Val Phe He Leu Asp lie Lys Tyr Giy 
275 280 285 

Glu Ala Pro Lys Met Thr Pro He Pro Leu Glu Thr He Arg Thr Phe 
290 295 300 

Lys Met Lys Ser He Ser Leu Gin Asp Val Pro His Leu Arg Pro His 
305 310 315 " 320 

Asp Lys Asp Ala Thr Ser Lys Tyr Leu He Glu Gin Val Glu Glu Met 
325 330 335 

He Arg Asp Ala Asn Glu Glu Thr Lys Gin Lys Leu Ala Asp Asp Giy 
340 345 350 

Glu Giy Asp Met Val Ala Glu Leu Pro Lys Pro Leu He Arg Leu Arg 
355 360 365 

Val Asp Tyr Ser Ala Pro Ser Asn Thr Gin Ser Pro He Asp Tyr Gin 
370 375 380 

Val Glu Asn Pro Arg Arg Phe Ser Asn Arg Phe Val Giy Arg Val Ala 
385 390 395 " " 400 

Asn Giy Asn Asn Val Val Gin Phe Tyr Lys Lys Arg Ser Pro Val Thr 
405 410 ' 415 

Arg Ser Lys Lys Ser Giy He Asn Giy Thr Ser He Ser Asp Arg Asp 
420 425 430 

Val Glu Lys Leu Phe Ser Glu Ser Giy Giy Glu Leu Glu Val Gin Thr 
435 440 445 

Leu Val Asn Asp Leu Leu Asn Lys Met Gin Leu Ser Leu Leu Pro Glu 
450 455 460 
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Val Gly Leu Asn Glu Ala Val Lys Lys Phe Val Asp Lys Asp Glu Lys 
465 470 475 480 

Thr Ala Leu Lys Glu Phe lie Ser His Glu lie Ser Asn Glu Val Gly 
4 85 4 90 4 95 

lie Leu Ser Thr Asn Glu Glu Phe Leu Arg Thr Asp Asp Ala Glu Glu 
500 505 510 

Met Lys Ala Leu lie Lys Gin Val Lys Arg Ala Asn Ser Val Arg Pro 
515 520 525 

Thr Pro Pro Lys Glu Asn Asp Glu Thr Asn Phe Ala Phe Asn Gly Asn 
530 535 540 

Gly Leu Asp Ser Phe Arg Ser Ser Asn Arg Glu Val Arg Thr Gly Ser 
545 550 555 560 

Pro Asp lie Thr Gin Ser His Val Asp Asn Glu Ser Arg lie Thr His 
565 570 575 

lie Ser Gin Ala Glu Ser Ser Lys Pro Thr Ser Lys Pro Lys Arg Val 
580 585 590 

Arg Thr Ala Thr Lys Lys Lys lie Pro Ala Phe Ser Asp Ser Thr Val 
595 600 605 

lie Ser Asp Ala Glu Asn Glu Leu Gly Asp Asn Asn Asp Ala Gin Asp 
610 615 620 

Asp Val Asp He Asp Glu Asn Asp He He Met Val Ser Thr Asp Glu 
625 630 635 640 

Glu Asp Ala Ser Tyr Gly Leu Leu Asn Gly Arg Lys Thr Lys Thr Lys 
645 650 655 

Thr Arg Pro Ala Ala Ser Thr Lys Thr Ala Ser Arg Arg Gly Lys Gly 
660 665 670 

Arg Ala Ser Arg Thr Pro Lys Thr Asp He Leu Gly Ser Leu Leu Ala 
675 680 685 

Lys Lys Arg Lys 
690 



<210> 30 
<211> 708 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 

<222> (1) . . (708) 

<223> /note="MRE 11 homologue" 

<400> 30 

Met Ser Thr Ala Asp Ala Leu Asp Asp Glu Asn Thr Phe Lys He Leu 
15 10 15 

Val Ala Thr Asp He His Leu Gly Phe Met Glu Lys Asp Ala Ala Arg 
20 25 30 

Gly Asn Asp Thr Phe Val Thr Leu Asp Glu He Leu Arg Leu Ala Gin 
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35 40 45 

Glu Asn Glu Val Asp Phe He Leu Leu Gly Gly Asp Leu Phe His Glu 
50 55 60 

Asn Lys Pro Ser Arg Lys Thr Leu His Thr Cys Leu Glu Leu Leu Arg 
65 70 75 80 

Lys Tyr Cys Met Gly Asp Arg Pro Val Gin Phe Glu He Leu Ser Asp 
85 90 95 

Gin Ser Val Asn Phe Gly Phe Ser Lys Phe Pro Trp Val Asn Tyr Gin 
100 105 HO 

Asp Gly Asn Leu Asn He Ser He Pro Val Phe Ser He His Gly Asn 
115 120 125 

His Asp Asp Pro Thr Gly Ala Asp Ala Leu Cys Ala Leu Asp He Leu 
130 135 140 

Ser Cys Ala Gly Phe Val Asn His Phe Gly Arg Ser Met Ser Val Glu 
145 150 155 160 

Lys He Asp He Ser Pro Val Leu Leu Gin Lys Gly Ser Thr Lys He 
165 170 175 

Ala Leu Tyr Gly Leu Gly Ser He Pro Asp Glu Arg Leu Tyr Arg Met 
180 185 190 

Phe Val Asn Lys Lys Val Thr Met Leu Arg Pro Lys Glu Asp Glu Asn 
195 200 205 

Ser Trp Phe Asn Leu Phe Val He His Gin Asn Arg Ser Lys His Gly 
210 215 220 

Ser Thr Asn Phe He Pro Glu Gin Phe Leu Asp Asp Phe He Asp Leu 
225 230 235 240 

Val He Trp Gly His Glu His Glu Cys Lys He Ala Pro Thr Lys Asn 
245 250 255 

Glu Gin Gin Leu Phe Tyr He Ser Gin Pro Gly Ser Ser Val Val Thr 
260 265 270 

Ser Leu Ser Pro Gly Glu Ala Val Lys Lys His Val Gly Leu Leu Arg 
275 280 285 

He Lys Gly Arg Lys Met Asn Met His Lys He Pro Leu His Thr Val 
290 2 95 300 

Arg Gin Phe Phe Met Glu Asp He Val Leu Ala Asn His Pro Asp He 
305 310 315 320 

Phe Asn Pro Asp Asn Pro Lys Val Thr Gin Ala He Gin Ser Phe Cys 
325 330 335 

Leu Glu Lys He Glu Glu Met Leu Glu Asn Ala Glu Arg Glu Arg Leu 
340 345 350 

Gly Asn Ser His Gin Pro Glu Lys Pro Leu Val Arg Leu Arg Val Asp 
355 360 365 

Tyr Ser Gly Gly Phe Glu Pro Phe Ser Val Leu Arg Phe Ser Gin Lys 
370 375 380 

Phe Val Asp Arg Val Ala Asn Pro Lys Asp He He His Phe Phe Arg 
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385 390 395 400 

His Arg Glu Gin Lys Glu Lys Thr Gly Glu Glu lie Asn Phe' Gly Lys 
405 410 415 

Leu He Thr Lys Pro Ser Glu Gly Thr Thr Leu Arg Val Glu Asp Leu 
420 425 430 

Val Lys Gin Tyr Phe Gin Thr Ala Glu Lys Asn Val Gin Leu Ser Leu 
435 440 445 

Leu Thr Glu Arg Gly Met Gly Glu Ala Val Gin Glu Phe Val Asp Lys 
450 455 460 

Glu Glu Lys Asp Ala He Glu Glu Leu Val Lys Tyr Gin Leu Glu Lys 
465 470 475 480 

Thr Gin Arg Phe Leu Lys Glu Arg His He Asp Ala Leu Glu Asp Lys 
485 490 495 

He Asp Glu Glu Val Arg Arg Phe Arg Glu Thr Arg Gin Lys Asn Thr 
500 505 510 

Asn Glu Glu Asp Asp Glu Val Arg Glu Ala Met Thr Arg Ala Arg Ala 
515 520 525 

Leu Arg Ser Gin Ser Glu Glu Ser Ala Ser Ala Phe Ser Ala Asp Asp 
530 535 540 

Leu Met Ser He Asp Leu Ala Glu Gin Met Ala Asn Asp Ser Asp Asp 
545 550 555 560 

Ser He Ser Ala Ala Thr Asn Lys Gly Arg Gly Arg Gly Arg Gly Arg 
565 570 575 

Arg Gly Gly Arg Gly Gin Asn Ser Ala Ser Arg Gly Gly Ser Gin Arg 
580 585 590 

Gly Arg Ala Phe Lys Ser Thr Arg Gin Gin Pro Ser Arg Asn Val Thr 
595 600 605 

Thr Lys Asn Tyr Ser Glu Val He Glu Val Asp Glu Ser Asp Val Glu 
610 615 620 

Glu Asp He Phe Pro Thr Thr Ser Lys Thr Asp Gin Arg Trp Ser Ser 
625 630 635 " 640 

Thr Ser Ser Ser Lys He Met Ser Gin Ser Gin Val Ser Lys Gly Val 
645 650 655 

Asp Phe Glu Ser Ser Glu Asp Asp Asp Asp Asp Pro Phe Met Asn Thr 
660 665 670 

Ser Ser Leu Arg Arg Asn Arg Arg Leu He Tyr Leu Leu Ala Leu Arg 
675 680 * 685 

Asn Met Gin Asp Thr Gly Lys Met Lys Cys Tyr Lys Leu Arg Val Tyr 
690 695 ~ 700 

Ser Leu Arg Phe 
705 



<210> 31 
<211> 720 
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<212> PRT 

<213> Arabidopsis thaliana 
<220> 

<221> SITE 

<222> (1) . . (720) 

<223> /note="MRE 11 homologue" 

<400> 31 

Met Ser Arg Glu Asp Phe Ser Asp Thr Leu Arg Val Leu Val Ala Thr 
15 10 15 

Asp Cys His Leu Gly Tyr Met Glu Lys Asp Glu He Arg Arg His Asp 
20 25 30 

Ser Phe Lys Ala Phe Glu Glu He Cys Ser lie Ala Glu Glu Lys Gin 
35 40 45 

Val Asp Phe Leu Leu Leu Gly ,Gly Asp Leu Phe His Glu Asn Lys Pro 
50 55 60 

Ser Arg Thr Thr Leu Val Lys Ala He Glu He Leu Arg Arg His Cys 
65 70 75 80 

Leu Asn Asp Lys Pro Val Gin Phe Gin Val Val Ser Asp Gin Thr Val 
85 90 95 

Asn Phe Gin Asn Ala Phe Gly Gin Val Asn Tyr Glu Asp Pro His Phe 
100 105 110 

Asn Val Gly Leu Pro Val Phe Ser He His Gly Asn His Asp Asp Pro 
115 120 125 

Ala Gly Val Asp Asn Leu Ser Ala He Asp He Leu Ser Ala Cys Asn 
130 135 140 

Leu Val Asn Tyr Phe Gly Lys Met Val Leu Gly Gly Ser Gly Val Gly 
145 150 155 - 160 

Gin He Thr Leu Tyr Pro He Leu Met Lys Lys Gly Ser Thr Thr Val 
165 170 175 

Ala Leu Tyr Gly Leu Gly Asn He Arg Asp Glu Arg Leu Asn Arg Met 
180 185 190 

Phe Gin Thr Pro His Ala Val Gin Trp Met Arg Pro Glu Val Gin Glu 
195 200 205 

Gly Cys Asp Val Ser Asp Trp Phe Asn lie Leu Val Leu His Gin Asn 
210 215 220 

Arg Val Lys Ser Asn Pro Lys Asn Ala He Ser Glu His Phe Leu Pro 
225 230 235 240 

Arg Phe Leu Asp Phe He Val Trp Gly His Glu His Glu Cys Leu He 
245 250 255 

Asp Pro Gin Glu Val Ser Gly Met Gly Phe His He Thr Gin Pro Gly 
260 265 270 

Ser Ser Val Ala Thr Ser Leu He Asp Gly Glu Ser Lys Pro Lys His 
275 280 285 

Val Leu Leu Leu Glu He Lys Gly Asn Gin Tyr Arg Pro Thr Lys He 
290 295 300 
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Pro Leu Thr Ser Val Arg Pro Phe Glu Tyr Thr Glu lie Val Leu Lys 
305 310 315 320 

Asp Glu Ser Asp He Asp Pro Asn Asp Gin Asn Ser He Leu Glu His 
325 330 335 

Leu Asp Lys Val Val Arg Asn Leu He Glu Lys Ala Ser Lys Lys Ala 
340 345 350 

Val Asn Arg Ser Glu He Lys Leu Pro Leu Val Arg He Lys Val Asp 
. 355 360 365 

Tyr Ser Gly Phe Met Thr He Asn Pro Gin Arg Phe Glv Gin Lys Tyr 
370 375 380 

Val Gly Lys Val Ala Asn Pro Gin Asp He Leu He Phe Ser Lys Ala 
385 390 395 400 

Ser Lys Lys Gly Arg Ser Glu Ala Asn He Asp Asp Ser Glu Arg Leu 
405 410 415 

Arg Pro Glu Glu Leu Asn Gin Gin Asn lie Glu Ala Leu Val Ala Glu 
420 425 430 

Ser Asn Leu Lys Met Glu He Leu Pro Val Asn Asp Leu Asp Val Ala 
435 440 445 

Leu His Asn Phe Val Asn Lys Asp Asp Lys Leu Ala Phe Tyr Ser Cys 
4 50 4 55 " 4 60 

Val Gin Tyr Asn Leu Gin Glu Thr Arg Gly Lys Leu Ala Lys Asp Ser 
465 470 475 480 

Asp Ala Lys Lys Phe Glu Glu Asp Asp Leu He Leu Lys Val Gly Glu 
485 490 495 

Cys Leu Glu Glu Arg Leu Lys Asp Arg Ser Thr Arg Pro Thr Gly Ser 
500 505 510 

Ser Gin Phe Leu Ser Thr Gly Leu Thr Ser Glu Asn Leu Thr Lys Gly 
515 520 525 

Ser Ser Gly He Ala Asn Ala Ser Phe Ser Asp Asp Glu Asp Thr Thr 
530 535 540 

Gin Met Ser Gly Leu Ala Pro Pro Thr Arg Gly Arg Arg Gly Ser Ser 
545 550 555 560 

Thr Ala Asn Thr Thr Arg Gly Arg Ala Lys Ala Pro Thr Arg Gly Arg 
565 570 575 

Gly Arg Gly Lys Ala Ser Ser Ala Met Lys Gin Thr Thr Leu Asp Ser 
580 585 590 

Ser Leu Gly Phe Arg Gin Ser Gin Arg Ser Ala Ser Ala Ala Ala Ser 
595 600 605 

Ala Ala Phe Lys Ser Ala Ser Thr He Gly Glu Asp Asp Val Asp Ser 
610 615 620 

Pro Ser Ser Glu Glu Val Glu Pro Glu Asp Phe Asn Lys Pro Asp Ser 
625 630 635 640 

Ser Ser Glu Asp Asp Glu Ser Thr Lys Gly Lys Gly Arg Lys Arg Pro 
645 65C 655 
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Ala Thr Thr Lys Arg Gly Arg Gly Arg Gly Ser Gly Thr Ser Lys Arg 
660 665 670 

Gly Arg Lys Asn Glu Ser Ser Ser Ser Leu Asn Arg Leu Leu Ser Ser 
675 680 685 

Lys Asp Asp Asp Glu Asp Glu Asp Asp Glu Asp Arg Glu Lys Lys Leu 
690 695 700 

Asn Lys Ser Gin Pro Arg Val Thr Arg Asn Tyr Gly Ala Leu Arg Arg 
705 710 715 720 



<210> 32 
<211> 1312 
<212> PRT 

<213> Saccharomyces cerevisiae 
<220> 

<221> SITE 

<222> (1) . . (1312) 

<223> /note="RAD 50" 

<400> 32 

Met Ser Ala lie Tyr Lys Leu Ser lie Gin Gly lie Arg Ser Phe Asp 
15 10 15 

Ser Asn Asp Arg Glu Thr He Glu Phe Gly Lys Pro Leu Thr Leu He 
20 25 ^ 30 

Val Gly Met Asn Gly Ser Gly Lys Thr Thr He He Glu Cys Leu Lys 
35 40 45 

Tyr Ala Thr Thr Gly Asp Leu Pro Pro Asn Ser Lys Gly Gly Val Phe 
50 55 60 ' 

He His Asp Pro Lys He Thr Gly Glu Lys Asp He Arg Ala Gin Val 
65 70 75 80 

Lys Leu Ala Phe Thr Ser Ala Asn Gly Leu Asn Met He Val Thr Arg 
85 " 90 95 

Asn He Gin Leu Leu Met Lys Lys Thr Thr Thr Thr Phe Lys Thr Leu 
100 105 110 

Glu Gly Gin Leu Val Ala He Asn Asn Ser Gly Asp Arg Ser Thr Leu 
115 120 125 

Ser Thr Arg Ser Leu Glu Leu Asp Ala Gin Val Pro Leu Tyr Leu Gly 
130 135 140 

Val Pro Lys Ala He Leu Glu Tyr Val He Phe Cys His Gin Glu Asp 
145 150 155 160 

Ser Leu Trp Pro Leu Ser Glu Pro Ser Asn Leu Lys Lys Lys Phe Asp 
165 170 175 

Glu lie Phe Gin Ala Met Lys Phe Thr Lys Ala Leu Asp Asn Leu Lys 
180 185 190 

Ser He Lys Lys Asp Met Ser Val Asp He Lys Leu Leu Lys Gin Ser 
195 200 ' 205 

Val Glu His Leu Lys Leu Asp Lys Asp Arg Ser Lys Ala Met Lys Leu 
210 215 220 
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Asn lie His Gin Leu Gin Thr Lys He Asp Gin Tyr Asn Glu Glu Val 
225 230 ' 235 240 

Ser Glu He Glu Ser Gin Leu Asn Glu He Thr Glu Lys Ser Asp Lys 
245 250 ' 255 

Leu Phe Lys Ser Asn Gin Asp Phe Gin Lys He Leu Ser Lys Val Glu 
260 265 ' 270 

Asn Leu Lys Asn Thr Lys Leu Ser He Ser Asp Gin Val Lys Arg Leu 
275 280 285 

Ser Asn Ser He Asp He Leu Asp Leu Ser Lys Pro Asp Leu Gin Asn 
290 295 " 300 

Leu Leu Ala Asn Phe Ser Lys Val Leu Met Asp Lys Asn Asn Gin Leu 
305 310 315 320 

Arg Asp Leu Glu Thr Asp He Ser Ser Leu Lys Asp Arg Gin Ser Ser 
325 330 335 

Leu Gin Ser Leu Ser Asn Ser Leu lie Arg Arg Gin Gly Glu Leu Glu 
340 345 350 

Ala Gly Lys Glu Thr Tyr Glu Lys Asn Arg Asn His Leu Ser Ser Leu 
355 360 365 

Lys Glu Ala Phe Gin His Lys Phe Gin Gly Leu Ser Asn He Glu Asn 
370 375 380 

Ser Asp Met Ala Gin Val Asn His Glu Met Ser Gin Phe Lys Ala Phe 
385 390 395 400 

He Ser Gin Asp Leu Thr Asp Thr He Asp Gin Phe Ala Lys Asp He 
405 410 415 

Gin Leu Lys Glu Thr Asn Leu Ser Asp Leu He Lys Ser He Thr Val 
420 425 430 

Asp Ser Gin Asn Leu Glu Tyr Asn Lys Lys Asp Arg Ser Lys Leu He 
435 440 445 

His Asp Ser Glu Glu Leu Ala Glu Lys Leu Lys Ser Phe Lys Ser Leu 
450 455 460 

Ser Thr Gin Asp Ser Leu Asn His Glu Leu Glu Asn Leu Lys Thr Tyr 
465 470 475 480 

Lys Glu Lys Leu Gin Ser Trp Glu Ser Glu Asn He He Pro Lys Leu 
485 490 495 

Asn Gin Lys He Glu Glu Lys Asn Asn Glu Met He He Leu Glu Asn 
500 505 510 

Gin He Glu Lys Phe Gin Asp Arg lie Met Lys Thr Asn Gin Gin Ala 
. 515 520 525 

Asp Leu Tyr Ala Lys Leu Gly Leu lie Lys Lys Ser He Asn Thr Lys 
530 535 540 

Leu Asp Glu Leu Gin Lys He Thr Glu Lys Leu Gin Asn Asp Ser Arg 
545 550 555 560 

He Arg Gin Val Phe Pro Leu Thr Gin Glu Phe Gin Arg Ala Asp Leu 
565 570 575 
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Glu Met Asp Phe Gin Lys Leu Phe He Asn Met Gin Lys Asn He Ala 
580 585 590 

He Asn Asn Lys Lys Met His Glu Leu Asp Arg Arg Tyr Thr Asn Ala 
595 600 605 

Leu Tyr Asn Leu Asn Thr He Glu Lys Asp Leu Gin Asp Asn Gin Lys 
610 615 620 

Ser Lys Glu Lys Val He Gin Leu Leu Ser Glu Asn Leu Pro Glu Asp 
625 630 635 640 

Cys Thr He Asp Glu Tyr Asn Asp Val Leu Glu Glu Thr Glu Leu Ser 
645 650 655 

Tyr Lys Thr Ala Leu Glu Asn Leu Lys Met His Gin Thr Thr Leu Glu 
660 665 670 

Phe Asn Arg Lys Ala Leu Glu He Ala Glu Arg Asp Ser Cys Cys Tyr 
675 680 ' 685 

Leu Cys Ser Arg Lys Phe Glu Asn Glu Ser Phe Lys Ser Lys Leu Leu 
690 695 700 

Gin Glu Leu Lys Thr Lys Thr Asp Ala Asn Phe Glu Lys Thr Leu Lys 
705 710 715 720 

Asp Thr Val Gin Asn Glu Lys Glu Tyr Leu His Ser Leu Arg Leu Leu 
725 730 735 

Glu Lys His He He Thr Leu Asn Ser He Asn Glu Lys He Asp Asn 
740 745 750 

Ser Gin Lys Cys Leu Glu Lys Ala Lys Glu Glu Thr Lys Thr Ser Lys 
755 760 " 765 

Ser Lys Leu Asp Glu Leu Glu Val Asp Ser Thr Lys Leu Lys Asp Glu 
770 775 780 

Lys Glu Leu Ala Glu Ser Glu He Arg Pro Leu lie Glu Lys Phe Thr 
785 790 795 800 

Tyr Leu Glu Lys Glu Leu Lys Asp Leu Glu Asn Ser Ser Lys Thr He 
805 810 815 

Ser Glu Glu Leu Ser He Tyr Asn Thr Ser Glu Asp Gly He Gin Thr 
820 825 ' 830 

Val Asp Glu Leu Arg Asp Gin Gin Arg Lys Met Asn Asp Ser Leu Arg 
835 840 845 

Glu Leu Arg Lys Thr He Ser Asp Leu Gin Met Glu Lys Asp Glu Lys 
850 855 860 

Val Arg Glu Asn Ser Arg Met He Asn Leu He Lys Glu Lys Glu Leu 
865 870 875 880 

Thr Val Ser Glu He Glu Ser Ser Leu Thr Gin Lys Gin Asn lie Asp 
885 890 895 

Asp Ser He Arg Ser Lys Arg Glu Asn lie Asn Asp He Asp Ser Arg 
900 905 910 

Val Lys Glu Leu Glu Ala Arg lie lie Ser Leu Lys Asn Lys Lys Asp 
915 920 925 
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Glu Ala Gin Ser Val Leu Asp Lys Val Lys Asn Glu Arg Asp lie Gin 
930 935 940 

Val Arg Asn Lys Gin Lys Thr Val Ala Asp lie Asn Arg Leu He Asp 
94 5 950 955 960 

Arg Phe Gin Thr He Tyr Asn Glu Val Val Asp Phe Glu Ala Lys Gly 
965 970 975 

Phe Asp Glu Leu Gin Thr Thr He Lys Glu Leu Glu Leu Asn Lys Ala 
980 985 990 

Gin Met Leu Glu Leu Lys Glu Gin Leu Asp Leu Lys Ser Asn Glu Val 
995 1000 * 1005 

Asn Glu Glu Lys Arg Lys Leu Ala Asp Ser Asn Asn Glu Glu Lys Asn 
1010 1015 1020 

Leu Lys Gin Asn Leu Glu Leu He Glu Leu Lys Ser Gin Leu Gin His 
1025 1030 1035 1040 

He Glu Ser Glu He Ser Arg Leu Asp Val Gin Asn Ala Glu Ala Glu 
1045 1050 1055 

Arg Asp Lys Tyr Gin Glu Glu Ser Leu Arg Leu Arg Thr Arg Phe Glu 
1060 1065 1070 

Lys Leu Ser Ser Glu Asn Ala Gly Lys Leu Gly Glu Met Lys Gin Leu 
1075 1080 " 1085 

Gin Asn Gin He Asp Ser Leu Thr His Gin Leu Arg Thr Asp Tyr Lys 
1090 1095 1100 

Asp He Glu Lys Asn Tyr His Lys Glu Trp Val Glu Leu Gin Thr Arg 
1105 1110 1115 1120 

Ser Phe Val Thr Asp Asp He Asp Val Tyr Ser Lys Ala Leu Asp Ser 
1125 1130 1135 

Ala He Met Lys Tyr His Gly Leu Lys Met Gin Asp He Asn Arg lie 
1140 1145 1150 

He Asp Glu Leu Trp Lys Arg Thr Tyr Ser Gly Thr Asp He Asp Thr 
1155 1160 1165 

lie Lys He Arg Ser Asp Glu Val Ser Ser Thr Val Lys Gly Lys Ser 
1170 1175 1180 

Tyr Asn Tyr Arg Val Val Met Tyr Lys Gin Asp Val Glu Leu Asp Met 
1185 H90 1195 1200 

Arg Gly Arg Cys Ser Ala Gly Gin Lys Val Leu Ala Ser He He lie 
1205 1210 1215 

Arg Leu Ala Leu Ser Glu Thr Phe Gly Ala Asn Cys Gly Val He Ala 
1220 1225 1230 

Leu Asp Glu Pro Thr Thr Asn Leu Asp Glu Glu Asn He Glu Ser Leu 
1235 1240 1245 

Ala Lys Ser Leu His Asn lie lie Asn Met Arg Arg His Gin Lys Asn 
1250 1255 1260 

Phe Gin Leu lie Val lie Thr His Asp Glu Lys Phe Leu Gly His Met 
1265 1270 1275 1280 
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Asn Ala Ala Ala Phe Thr Asp His Phe Phe Lys Val Lys Arg Asp Asp 
1285 1290 1295 

Arg Gin Lys Ser Gin He Glu Trp Val Asp lie Asn Arg Val Thr Tyr 
1300 1305 1310 
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<210> 33 
<211> 1318 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 

<222> (1)..(1318) 

<223> /note="RAD 50 homologue" 

<400> 33 

Met Leu He Phe Ser Val Arg Asp Met Phe Ala Lys Met Ser He Leu 
15 10 15 

Gly Val Arg Ser Phe Gly He Glu Asp Lys Asp Lys Gin He He Thr 
20 25 30 

Phe Phe Ser Pro Leu Thr He Leu Val Gly Pro Asn Gly Ala Gly Lys 
35 40 45 

Thr Thr He He Glu Cys Leu Lys Tyr He Cys Thr Gly Asp Phe Pro 
50 55 60 

Pro Gly Thr Lys Gly Asn Thr Phe Val His Asp Pro Lys Val Ala Gin 
€5 70 75 80 

Glu Thr Asp Val Arg Ala Gin He Arg Leu Gin Phe Arg Asp Val Asn 
85 90 95 

Gly Glu Leu He Ala Val Gin Arg Ser Met Val Cys Thr Gin Lys Ser 
100 105 110 

Lys Lys Thr Glu Phe Lys Thr Leu Glu Gly Val He Thr Arg Thr Lys 
115 120 125 

His Gly Glu Lys Val Ser Leu Ser Ser Lys Cys Ala Glu He Asp Arg 
130 135 140 

Glu Met He Ser Ser Leu Gly Val Ser Lys Ala Val Leu Asn Asn Val 
145 150 155 160 

He Phe Cys His Gin Glu Asp Ser Asn Trp Pro Leu Ser Glu Gly Lys 
165 170 175 

Ala Leu Lys Gin Lys Phe Asp Glu He Phe Ser Ala Thr Arg Tyr He 
180 185 190 

Lys Ala Leu Glu Thr Leu Arg Gin Val Arg Gin Thr Gin Gly Gin Lys 
195 200 205 

Val Glu Glu Tyr Gin Met Glu Leu Lys Tyr Leu Lys Gin Tyr Lys Glu 
210 215 220 

Lys Ala Cys Glu lie Arg Asp Gin He Thr Ser Lys Glu Ala Gin Leu 
225 230 235 240 

Thr Ser Ser Lys Glu He Val Lys Ser Tyr Glu Asn Glu Leu Asp Pro 
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245 250 255 

Leu Lys Asn Arg Leu Lys Glu lie Glu His Asn Leu Ser Lys lie Met 
260 265 270 

Lys Leu Asp Asn Glu He Lys Ala Leu Asp Ser Arg Lys Lys Gin Met 
275 280 285 

Glu Lys Asp Asn Ser Glu Leu Glu Glu Lys Met Glu Lys Val Phe Gin 
290 295 300 

Gly Thr Asp Glu Gin Leu Asn Asp Leu Tyr His Asn His Gin Arg Thr 
305 310 315 320 

Val Arg Glu Lys Glu Arg Lys Leu Val Asp Cys His Arg Glu Leu Glu 
325 330 335 

Lys Leu Asn Lys Glu Ser Arg Leu Leu Asn Gin Glu Lys Ser Glu Leu 
340 345 * 350 

Leu Val Glu Gin Gly Arg Leu Gin Leu Gin Ala Asp Arg His Gin Glu 
355 360 365 

His lie Arg Ala Arg Asp Ser Leu He Gin Ser Leu Ala Thr Gin Leu 
370 375 380 

Glu Leu Asp Gly Phe Glu Arg Gly Pro Phe Ser Glu Arg Gin He Lys 
385 390 395 400 

Asn Phe His Lys Leu Val Arg Glu Arg Gin Glu Gly Glu Ala Lys Thr 
405 410 415 

Ala Asn Gin Leu Met Asn Asp Phe Ala Glu Lys Glu Thr Leu Lys Gin 
420 425 430 

Lys Gin He Asp Glu He Arg Asp Lys Lys Thr Gly Leu Gly Arg lie 
435 440 445 

He Glu Leu Lys Ser Glu He Leu Ser Lys Lys Gin Asn Glu Leu Lys 
450 455 460 

Asn Val Lys Tyr Glu Leu Gin Gin Leu Glu Gly Ser Ser Asp Arg He 
465 470 475 .480 

Leu Glu Leu Asp Gin Glu Leu He Lys Ala Glu Arg Glu Leu Ser Lvs 
485 * 490 495 

Ala Glu Lys Asn Ser Asn Val Glu Thr Leu Lys Met Glu Val He Ser 
500 505 510 

Leu Gin Asn Glu Lys Ala Asp Leu Asp Arg Thr Leu Arg Lys Leu Asp 
515 520 525 

Gin Glu Met Glu Gin Leu Asn His His Thr Thr Thr Arg Thr Gin Met 
530 535 540 

Glu Met Leu Thr Lys Asp Lys Ala Asp Lys Asp Glu Gin He Arg Lys 
545 550 555 560 

He Lys Ser Arg His Ser Asp Glu Leu Thr Ser Leu Leu Gly Tyr Phe 
565 570 575 

Pre Asn Lys Lys Gin Leu Glu Asp Trp Leu His Ser Lys Ser Lys Glu 
580 585 590 

He Asn Gin Thr Arg Asp Arg Leu Ala Lys Leu Asn Lys Glu Leu AJ a 
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595 600 605 

Ser Ser Glu Gin Asn Lys Asn His lie Asn Asn Glu Leu Glu Arg Lys 
610 615 620 

Glu Glu Gin Leu Ser Ser Tyr Glu Asp Lys Leu Phe Asp Val Cys Gly 
62 5 630 635 640 

Ser Gin Asp Phe Glu Ser Asp Leu Asp Arg Leu Lys Glu Glu lie Glu 
645 650 655 

Lys Ser Ser Lys Gin Arg Ala Met Leu Ala Gly Ala Thr Ala Val Tyr 
660 665 670 

Ser Gin Phe He Thr Gin Leu Thr Asp Glu Asn Gin Ser Cys Cys Pro 
675 680 685 

Val Cys Gin Arg Val Phe Gin Thr Glu Ala Glu Leu Gin Glu Ala He 
690 695 700 

Ser Asp Leu Gin Ser Lys Leu Arg Leu Ala Pro Asp Lys Leu Lys Ser 
705 710 715 720 

Thr Glu Ser Glu Leu Lys Lys Lys Glu Lys Arg Arg Asp Glu Met Leu 
725 730 735 

Gly Leu Ala Pro Met Arg Gin Ser He He Asp Leu Lys Glu Lys Glu 
740 745 * 750 

He Pro Glu Leu Arg Asn Lys Leu Gin Asn Val Asn Arg Asp lie Gin 
755 760 765 

Arg Leu Lys Asn Asp He Glu Glu Gin Glu Thr Leu Leu Gly Thr He 
770 775 780 

Met Pro Glu Glu Glu Ser Ala Lys Val Cys Leu Thr Asp Val Thr He 
785 790 795 800 

Met Glu Arg Phe Gin Met Glu Leu Lys Asp Val Glu Arg Lys He Ala 
805 810 815 

Gin Gin Ala Ala Lys Leu Gin Gly He Asp Leu Asp Arg Thr Val Gin 
820 825 830 

Gin Val Asn Gin Glu Lys Gin Glu Lys Gin His Lys Leu Asp Thr Val 
835 840 845 

Ser Ser Lys He Glu Leu Asn Arg Lys Leu He Gin Asp Gin Gin Glu 
850 855 860 

Gin He Gin His Leu Lys Ser Thr Thr Asn Glu Leu Lys Ser Glu Lys 
8 65 870 875 880 

Leu Gin He Ser Thr Asn Leu Gin Arg Arg Gin Gin Leu Glu Glu Gin 
885 890 895 

Thr Val Glu Leu Ser Thr Glu Val Gin Ser Leu Tyr Arg Glu He Lys 
900 905 " 910 

Asp Ala Lys Glu Gin Val Ser Pro Leu Glu Thr Thr Leu Glu Lys Phe 
915 920 925 

Gin Gin Glu Lys Glu Glu Leu He Asn Lys Lys Asn Thr Ser Asn Lys 
930 935 940 

He Ala Gin Asp Lys Leu Asn Asp He Lys Glu Lys Val Lys Asn He 
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945 950 955 960 

His Gly Tyr Met Lys Asp He Glu Asn His lie Gin Asp Gly Lys Asp 
965 970 975 

Asp Tyr Met Lys Gin Lys Glu Thr Glu Leu Asn Lys Val He Ala Gin 
980 985 990 

Leu Ser Glu Cys Glu Lys His Lys Glu Lys He Asn Glu Asp Met Arg 
995 1000 1005 

Leu Met Arg Gin Asp He Asp Thr Gin Lys He Gin Glu Arg Trp Leu 
1010 1015 ~ 1020 

Gin Asp Asn Leu Thr Leu Arg Lys Arg Asn Glu Glu Leu Lys Glu Val 
1025 1030 1035 1040 

Glu Glu Glu Gly Lys Gin His Leu Lys Glu Met Gly Gin Met Gin Val 
1045 1050 ~ 1055 

Leu Gin Met Lys Ser Glu His Gin Lys Leu Glu Glu Asn He Asp Asn 
1060 1065 1070 

He Lys Arg Asn His Asn Leu Ala Leu Gly Arg Gin Lys Gly Tyr Glu 
1075 1080 " 1085 

Glu Glu He He His Phe Lys Lys Glu Leu Arg Glu Pro Gin Phe Arg 
1090 1095 1100 

Asp Ala Glu Glu Lys Tyr Arg Glu Met Met He Val Met Arg Thr Thr 
1105 1110 1115 " 1120 

Glu Leu Val Asn Lys Asp Leu Asp He Tyr Tyr Lys Thr Leu Asp Gin 
1125 1130 1135 

Ala He Met Lys Phe His Ser Met Lys Met Glu Glu He Asn Lys He 
1140 1145 1150 

He Arg Asp Leu Trp Arg Ser Thr Tyr Arg Gly Gin Asp He Glu Tyr 
1155 1160 1165 

He Glu He Arg Ser Asp Ala Asp Glu Asn Val Ser Ala Ser Asp Lys 
1170 1175 1180 

Arg Arg Asn Tyr Asn Tyr Arg Val Val Met Leu Lys Gly Asp Thr Ala 
1185 1190 1195 1200 

Leu Asp Met Arg Gly Arg Cys Ser Ala Gly Gin Lys Val Leu Ala Ser 
1205 1210 1215 

Leu lie He Arg Leu Ala Leu Ala Glu Thr Phe Cys Leu Asn Cys Gly 
1220 1225 1230 

He lie Ala Leu Asp Glu Pro Thr Thr Asn Leu Asp Arg Glu Asn He 
1235 1240 1245 

Glu Ser Leu Ala His Ala Leu Val Glu He He Lys Ser Arg Ser Gin 
1250 1255 1260 

Gin Arg Asn Phe Gin Leu Leu Val He Thr His Asp Glu Asp Phe Val 
1265 1270 1275 1280 

Glu Leu Leu Gly Arg Ser Glu Tyr Val Glu Lys Phe Tyr Arg He Lys 
1285 1290 1295 

Lys Asn He Asp Gin Cys Ser Glu He Val Lys Cys Ser Val Ser Ser 
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1300 1305 1310 



Leu Gly Phe Asn Val His 
1315 



<210> 34 

<211> 1292 

<212> PRT 

<213> Arabidopsis thaliana 
<220> 

<221> SITE 

<222> (1) . . (1292) 

<223> /note="RAD 50 homologue" 

<400> 34 

Met Ser Thr Val Asp Lys Met Leu lie Lys Gly He Arg Ser Phe Asp 
15 10 15 

Pro Glu Asn Lys Asn Val Val Thr Phe Phe Arg Pro Leu Thr Leu He 
20 25 30 

Val Gly Ala Asn Gly Ala Gly Lys Thr Thr He He Glu Cys Leu Lys 
35 40 45 

Val Ser Cys Thr Gly Glu Leu Pro Pro Asn Ala Arg Ser Gly His Ser 
50 55 60 

Phe He His Asp Pro Lys Val Ala Gly Glu Thr Glu Thr Lys Ala Gin 
65 70 75 80 

He Lys Leu Arg Phe Lys Thr Ala Ala Gly Lys Asp Val Val Cys He 
85 90 95 

Arg Ser Phe Gin Leu Thr Gin Lys Ala Ser Lys Met Glu Tyr Lys Ala 
100 105 110 

He Glu Ser Val Leu Gin Thr He Asn Pro His Thr Gly Glu Lys Val 
115 120 125 

Cys Leu Ser Tyr Arg Cys Ala Asp Met Asp Arg Glu He Pro Ala Leu 
130 135 ~ 140 

Met Gly Val Ser Lys Ala He Leu Glu Asn Val He Phe Val His Gin 
145 150 155 160 

Asp Glu Ser Asn Trp Pro Leu Gin Asp Pro Ser Thr Leu Lys Lys Lys 
165 170 175 

Fhe Asp Asp He Phe Ser Ala Thr Arg Tyr Thr Lys Ala Leu Glu Val 
180 185 190 

He Lys Lys Leu His Lys Asp Gin Ala Gin Glu He Lys Thr Phe Lys 
195 200 205 

Leu Lys Leu Glu Asn Leu Gin Thr Leu Lys Asp Ala Ala Tyr Lys Leu 
210 215 ~ 220 

Arg Glu Ser He Ala Gin Asp Gin Glu Arg Thr Glu Ser Ser Lys Val 
225 230 235 240 

Gin Met Leu Glu Leu Glu Thr Ser Val Gin Lys Val Asp Ala Glu Val 
245 250 255 
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His Asn Lys Glu Met Met Leu Lys Asp Leu Arg Lys Leu Gin Asp Gin 
260 265 270 

Val Ser He Lys Thr Ala Glu Arg Ser Thr Leu Phe Lys Glu Gin Gin 
275 280 285 

Arg Gin Tyr Ala Ala Leu Pro Glu Glu Asn Glu Asp Thr lie Glu Glu 
290 295 300 

Leu Lys Glu Trp Lys Ser Lys Phe Glu Glu Arg Leu Ala Leu Leu Gly 
305 310 315 320 

Thr Lys He Arg Lys Met Glu Arg Glu Met Val Asp Thr Glu Thr Thr 
325 330 335 

He Ser Ser Leu His Asn Ala Lys Thr Asn Tyr Met Leu Glu He Ser 
340 345 350 

Lys Leu Gin Thr Glu Ala Glu Ala His Met Leu Leu Lys Asn Glu Arg 
355 360 365 

Asp Ser Thr He Gin Asn He Phe Phe His Tyr Asn Leu Gly Asn Val 
370 375 380 

Pro Ser Thr Pro Phe Ser Thr Glu Val Val Leu Asn Leu Thr Asn Arg 
385 390 395 400 

He Lys Ser Arg Leu Gly Glu Leu Glu Met Asp Leu Leu Asp Lys Lys 
405 410 415 

Lys Ser Asn Glu Thr Ala Leu Ser Thr Ala Trp Asp Cys Tyr Met Asp 
420 425 430 

Ala Asn Asp Arg Trp Lys Ser He Glu Ala Gin Lys Arg Ala Lys Asp 
435 440 445 

Glu He Lys Met Gly He Ser Lys Arg lie Glu Glu Lys Glu He Glu 
450 455 460 

Arg Asp Ser Phe Glu Phe Glu He Ser Thr Val Asp Val Lys Gin Thr 
465 470 475 480 

Asp Glu Arg Glu Lys Gin Val Gin Val Glu Leu Glu Arg Lys Thr Lys 
485 490 495 

Gin Asn Ser Glu Arg Gly Phe Glu Ser Lys He Glu Gin Lys Gin His 
500 505 510 

Glu He Tyr Ser Leu Glu His Lys He Lys Thr Leu Asn Arg Glu Arg 
515 520 525 

Asp Val Met Ala Gly Asp Ala Glu Asp Arg Leu Leu Thr Arg He Asp 
530 535 540 

Glu Cys Lys Asp Arg He Arg Gly Val Leu Lys Gly Arg Leu Pro Pro 
545 550 555 560 

Glu Lys Asp Met Lys Arg Glu He Val Gin Ala Leu Arg Ser He Glu 
565 570 575 

Arg Glu Tyr Asp Asp Leu Ser Leu Lys Ser Arg Glu Ala Glu Lys Glu 
580 585 590 

Val Asn Met Leu Gin Met Lys He Gin Glu Val Asn Asn Ser Leu Phe 
595 600 605 
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Lys His Asn Lys Asp Thr Glu Ser Arg Lys Arg Tyr lie Glu Ser Lys 
610 615 620 

Leu Gin Ala Leu Lys Gin Glu Ser Val Thr He Asp Ala Tyr Pro Lys 
625 630 635 640 

Leu Leu Glu Ser Ala Lys Asp Lys Arg Asp Asp Arg Lys Arg Glu Tyr 
645 650 655 

Asn Met Ala Asn Gly Met Arg Gin Met Phe Glu Pro Phe Glu Lys Arg 
660 665 670 

Ala Arg Gin Glu His Ser Cys Pro Cys Cys Glu Arg Ser Phe Thr Ala 
675 680 685 

Asp Glu Glu Ala Ser Phe He Lys Lys Gin Arg Val Lys Ala Ser Ser 
690 695 " 700 

Thr Gly Glu His Leu Lys Ala Leu Ala Val Glu Ser Ser Asn Ala Asp 
7 05 710 715 720 

Ser Val Phe Gin Gin Leu Asp Lys Leu Arg Ala Val Phe Glu Glu Tyr 

725 730 735 

Ser Lys Leu Thr Thr Glu He He Pro Leu Ala Glu Lys Thr Leu Gin 
740 745 750 

Glu His Thr Glu Glu Leu Gly Gin Lys Ser Glu Ala Leu Asp Asp Val 
755 760 765 

Leu Gly He Ser Ala Gin He Lys Ala Asp Lys Asp Ser He Glu Ala 
770 775 780 

Leu Val Gin Pro Leu Glu Asn Ala Asp Arg He Phe Gin Glu He Val 
785 790 795 800 

Ser Tyr Gin Lys Gin He Glu Asp Leu Glu Tyr Lys Leu Asp Phe Arg 
805 810 815 

Gly Leu Gly Val Lys Thr Met Glu Glu He Gin Ser Glu Leu Ser Ser 
820 825 830 

Leu Gin Ser Ser Lys Asp Lys Leu His Gly Glu Leu Glu Lys Leu Arg 
835 840 845 

Asp Asp Gin He Tyr Met Glu Arg Asp He Ser Cys Leu Gin Ala Arg 
850 855 860 

Trp His Ala Val Arg Glu Glu Lys Ala Lys Ala Ala Asn Leu Leu Arg 
865 870 875 880 

Asp Val Thr Lys Ala Glu Glu Asp Leu Glu Arg Leu Ala Glu Glu Lys 
885 890 895 

Ser Gin Leu Asp Leu Asp Val Lys Tyr Leu Thr Glu Ala Leu Gly Pro 
900 905 910 

Leu Ser Lys Glu Lys Glu Gin Leu Leu Ser Asp Tyr Asn Asp Met Lys 
915 920 925 

He Arg Arg Asn Gin Glu Tyr Glu Glu Leu Ala Glu Lys Lys Arg Asn 
930 935 94 0 

Tyr Gin Gin Glu Val Glu Ala Leu Leu Lys Ala Ser Tyr Lys He Asn 
945 950 955 960 
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Asp Cys Phe Thr Arg Tyr His Asp Leu Lys Lys Gly Glu Arg Leu Asp 
965 970 J 975 

Asp lie Gin Glu Lys Gin Arg Leu Ser Asp Ser Gin Leu Gin Ser Cys 
980 985 990 

Glu Ala Arg Lys Asn Glu Leu Ala Gly Glu Leu Asn Arg Asn Lys Asp 
995 1000 1005 

Leu Met Arg Asn Gin Asp Gin Leu Arg Arg Asn lie Glu Asp Asn Leu 
1010 1015 1020 

Asn Tyr Arg Thr Thr Lys Ala Lys Val Glu Glu Leu Thr Arg Glu He 
1025 1030 1035 1040 

Glu Ser Leu Glu Glu Gin He Leu Asn He Gly Gly He Ala Ala Val 
1045 1050 1055 

Glu Ala Glu He Val Lys He Leu Arg Glu Arg Glu Arg Leu Leu Ser 
1060 1065 1070 

Glu Leu Asn Arg Cys Arg Gly Thr Val Ser Val Tyr Glu Ser Ser He 
1075 1080 1085 

Ser Lys Asn Arg Val Glu Leu Lys Gin Ala Gin Tyr Lys Asp He Asp 
1090 1095 1100 

Lys Arg His Phe Asp Gin Leu He Gin Leu Lys Thr Thr Glu Met Ala 
1105 1110 1115 1120 

Asn Lys Asp Leu Asp Arg Tyr Tyr Asn Ala Leu Asp Lys Ala Leu Met 
1125 " 1130 1135 

Arg Phe His Thr Met Lys Met Glu Glu He Asn Lys He He Arg Glu 
1140 1145 1150 

Leu Trp Gin Gin Thr Tyr Arg Gly Gin Asp Met Asp Tyr He Arg He 
1155 1160 1165 

His Ser Asp Ser Glu Gly Ala Gly Thr Arg Ser Tyr Ser Tyr Lys Val 
1170 1175 1180 

Leu Met Gin Thr Gly Asp Thr Glu Leu Glu Met Arg Gly Arg Cys Ser 
1185 1190 1195 1200 

Ala Gly Gin Lys Val Leu Ala Ser Leu He He Arg Leu Ala Leu Ala 
1205 1210 ~ 1215 

Glu Thr Phe Cys Leu Asn Cys Gly He Leu Ala Leu Asp Glu Pro Thr 
1220 1225 1230 

Thr Asn Leu Asp Gly Pro Asn Ser Glu Ser Leu Ala Gly Ala Leu Leu 
1235 1240 1245 

Arg He Met Glu Asp Arg Lys Gly Gin Glu Asn Phe Gin Leu lie Val 
1250 1255 1260 

He Thr His Asp Glu Arg Phe Ala Gin Met He Gly Gin Arg Gin His 
1265 1270 1275 1280 

Ala Glu Lys Tyr Tyr Arg Val Ala Lys Asp Asp Met 
1285 " 1290 
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Claims 



1 . A method to direct integration of a nucleic acid of interest to a pre-determined site, whereby said nucleic acid has 
homology at or around the said pre-determined site, in an eukaryote with a preference for non-homologous re- 

5 combination, comprising steering an integration pathway towards homologous recombination. 

2. A method to direct nucleic acid integration according to claim 1, comprising providing a mutant of a component 
involved in non-homologous recombination. 

10 3. A method to direct nucleic acid integration according to claim 1 or 2, comprising inhibiting a component involved 
in non-homologous recombination. 

4. A method according to claim 2 or 3 wherein said component involved in non-homologous recombination comprises 
/a/70, radSO, mre11 s xrs2, Iig4 or sir4. 

15 

5. A method to direct integration of a nucleic acid of interest to a subtelomeric and/or telomeric region in an eukaryote 
with a preference for non-homologous recombination by providing a mutant of a component involved in non-ho- 
mologous recombination. 

20 6. A method to direct integration of a nucleic acid of interest to a subtelomeric and/or telomeric region in an eukaryote 
with a preference for non-homologous recombination, comprising inhibiting a component involved in non-homol- 
ogous recombination 

7. A method to direct integration according to claim 5 or 6 wherein said component involved in non-homologous 
25 recombination comprises radSO, mrel 1 or xrs2. 

8. A method according to anyone of claims 1 to 7 wherein said eukaryote comprises yeast. 

9. A method according to anyone of claims 1-8 comprising transiently inhibiting integration via non-homologous re- 
30 combination. 



10. A method according claim 9 wherein said transiently inhibiting is provided by an Agrobacterium Vir-fusion protein 
capable of inhibiting a component involved in non-homologous recombination. 

35 11. A method to direct nucleic acid integration according to claim 10 wherein said Agrobacterium Vir fusion protein 
comprises VirF or VirE2. 

12. A method according to claim 10 or 11 wherein said component involved in non-homologous recombination com- 
prises ku70, rad50, mrel 7, xrs2, Iig4 or sir4. 

40 

13. A method according to anyone of the aforegoing claims wherein said nucleic acid of interest comprises an inactive 
gene to replace an active gene. 

14. A method according to anyone of claims 1-12, wherein said nucleic acid of interest comprises an active gene to 
45 replace an inactive gene. 

15. A method according to anyone of claims 1 -1 2, wherein said nucleic acid of interest encodes a therapeutic protein- 
aceous substance. 



50 16. A method according to anyone of claims 1-12, wherein said nucleic acid of interest encodes a substance conferring 
resistance for an antibiotic substance to a cell. 



17. A method according to anyone of claims 1-12, wherein said nucleic acid of interest confers a desired property to 
said eukaryotic cell. 

55 

18. A method according to anyone of the aforegoing claims wherein said nucleic acid of interest is part of a gene 
delivery vehicle. 
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19. Use of a method according to anyone of claims 1 to 18 for improvement of gene targeting efficiency. 
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